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TITLE 

Fusion Proteins Comprising Tumor Necrosis Factor Receptor 
RArKGRnUND OF THE INVENTION 



A number of cytokines are known to bind to specific receptor proteins on the 
surface of target cells. Among the specific receptor proteins that have been identified 

10 are tumor necrosis factor receptors and interleukin-l receptors. Much effon is being 
directed toward isolation and characterization of a number of receptors in order to study 
their physiological roles and to explore possible therapeutic uses. The binding of a 
particular target molecule by a soluble receptor administered to a patient may alleviate 
disorders mediated by the target molecule. 

15 Tumor necrosis factor-a (TNFo, also known as cachectin) and tumor necrosis 

factor-p (TNFp, also known as lymphotoxin) are homologous mammalian endogenous 
secretor>^ proteins capable of inducing a wide variet>' of effects on a large number of 
cell types. The great similarities in the structural and functional characteristics of these 
two cytokines have resulted in their collective description as "TNF/' Complementary 

20 cDNA clones encoding TNFa (Pennica et al., Nature 572:724, 1984) and TNFp (Gray 
et al., Nature 572:721, 1984) have been isolated, permitting further structural and 
biological characterization of TNF. 

TNF proteins initiate their biological effect on cells by binding to specific TNF 
receptor (TNF-R) proteins expressed on the plasma membrane of a TNF-responsive 

25 cell. TNTa and TNFp were first shown to bind to a common receptor on the human 
cervical carcinoma cell Hne ME-180 (Aggarwal et al., Nature 575:665, 1985). 
Hohmann et al. {J, Biol, Chem, 264:14927, 1989) reponed that at least two different 
cell surface receptors for TNF exist on different cell types, although the relationship 
beuveen these TNF-Rs is unclear. These receptors have an apparent molecular mass of 

30 about 75 - 80 kDa and about 55-60 kDa, respectively. In addition to cell surface 
receptors for TNT, soluble proteins from human urine capable of binding TNF have 
also been identified (Peetre et al., Eur. J, Haematol 47:414, 1988; Seckinger et al., J, 
Exp. Med, 767:1511, 1988; Seckinger et al., J. Biol. Chem, 264:11966, 1989; UK 
Patent Application, Publ. No. 2 218 101 A to Seckinger et al.; Engelmann et al., 7. 

35 Biol. Chem. 264:1 1974, 1989). 
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Interleukin-la (IL-la) and interleukin-lB (IL-1I5) are distantly related 
polypeptide hormones that play a central role in the regulation of immune and 
inflammatory responses. These two proteins act on a variety of cell types and have 
multiple biological activities. The biological activities ascribed to IL-la and IL-IB are 
5 mediated via at least two classes of plasma membrane bound receptors which bind both 
IL-la and IL-IB. The IL-1 receptors expressed on B cells (referred to herein as type II 
IL-1 receptors) are different from IL-1 receptors detected on T cells and other cell types 
(refeired to herein as type I IL-1 receptors). 



10 SUMMARY OF THE TNVFA-TTnNJ 

The present invention is directed to receptors comprising a first tumor necrosis 
factor receptor (TNF-R) polypeptide covaiently linked to a second TNF-R polypeptide. 
Alternatively, the receptor comprises one or two TNF-R polypeptides covaiently linked 

15 to one or two interleukin-1 receptor (TL-IR) polypeptides. 

The receptors preferably are produced as fusion proteins via lecombinant DNA 
technology. The present invention provides fusion proteins comprising, as one of at 
least two biologically active polypeptide components, a TNF-R polypeptide. One 
fusion protein of the present invention comprises two TNF-R polypeptides, preferably 

20 joined via a peptide linker. 

In another embodiment of the invention, the fusion protein comprises TNF-R 
and IL-IR. The fusion protein preferably comprises two TNF-R polypeptides and 
either one or two IL-IR polypeptides. 

The present invention also provides isolated DNA sequences encoding the 

25 fusion proteins, recombinant expression vectors comprising such DNA sequences, host 
cells containing the expression vectors, and processes for producing the recombinant 
fusion proteins by culturing the host cells. Pharmaceutical compositions comprising a 
purified fusion protein as described above and a suitable diluent, carrier, or excipient 
are also provided by the present invention. Such compositions are useful in therapy, 

30 diagnosis, and assays for conditions mediated by mmor necrosis factor or inrerieukin- 
L 



BRIEF DESCRTPTTON OF T HE DRAWTNOS; 



35 



Figure 1 presents a restriction map for a type I human IL-IR cDNA clone. The 
sites at which certain restriction enzymes cleave the cDNA are shown. 
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Figures 2A-2B depict the panial cDNA sequence and derived amino acid 
sequence of a human TNF-R clone. Nucleotides are numbered from the beginning of 
the 5' untranslated region. Amino acids are numbered from the beginning of the signal 
peptide sequence. The apparent signal peptide sequence is represented by the amino 
acids -22 to -L The N-terminal leucine of the mature TNF-R protein is underlined at 
position 1. The apparent transmembrane region from amino acids 236 to 265 is also 
underlined. The C-termini of various soluble TNT-Rs are marked with an arrow (C). 
Cleavage sites for certain restriction endonucleases employed in constructing 
expression vectors are indicated- 
Figure 3 depicts a plasmid vector comprising a DNA fragment encoding a 
fusion protein of the formula TNT-R-Iinker-TNF-R, constructed as described in 
example 11. 

Figure 4 depicts a plasmid vector comprising a DNA fragment encoding a 
fusion protein of the formula IL-lR4inker-TNF-R-linker-TNF-R, constructed as 
15 described in example 12. 

Figure 5 depicts three plasmid vectors that are intermediates in the construction 
of cenain vectors of the present invention, as described in example 12. 



DETAILED DESCRTPTTO N OF THF TNVFKTfnfw 

20 

The present invention is directed to receptors comprising a first TNF-R 
polypeptide covalently linked to a second TNF-R polypeptide. Alternatively, the 
receptor comprises one or two TNF-R polypeptides covalently linked to one or two IL- 
IR polypeptides. The TNT-R and (when present) IL-IR polypeptide components may 

25 be attached to one another using any suitable technique for attaching one polypeptide to 
another. Cross-linking reagents and peptide linkers are among the linkage methods that 
may be employed. The TNF-R and IL-IR polypeptides are derived from mammalian 
species, preferably human. 

The receptors preferably are produced as fusion proteins via recombinant DNA 

30 technology. The present invention provides fusion proteins comprising, as one 
component, a mammalian tumor necrosis factor receptor (TNF-R). One fusion protein 
of the present invention comprises two TNT-R polypeptides and may be represented by 
the following formula: 



35 



TNT-R - linker - TNF-R 
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wherein the linker is a peptide linker. 

In another embcxiinient of the invention, the fusion protein comprises TNF-R 
and a mammalian interleukin-l receptor (IL-IR). In this embodiment of the invention, 
the fusion protein may comprise one TNF-R polypeptide and one IL-IR polypeptide. 
5 Preferably, two TNF-R polypeptides and one IL-IR polypeptide are joined to fomi the 
fusion protein. The two TNF-R polypeptides preferably are adjacent to one another (as 
opposed to IL-IR being positioned between the two TNF-Rs) to enhance binding of 
TNF. Examples of such fusion proteins are represented by the following foraiulas: 

10 TNF-R -linker -IL-IR 

IL-IR -linker -TNF-R 

TNF-R - linker - TNF-R - linker - IL-IR and 

15 

IL-IR - linker - TNF-R - linker - TNF-R 



wherein each hnker is a peptide linker. The N-terminus of each fusion protein is on the 
left side of each formula. 

20 Each TNF-R polypeptide component of the fusion proteins is independentiy 

capable of binding mmor necrosis factor (TNF). Likewise, each EL-IR polypeptide 
employed in the fusion proteins is independentiy capable of binding interleukin-l (IL- 
1). Including two adjacent TNF-R polypeptides in the fusion protein is advantageous 
in that the TNF binding affinity is increased compared to ±c binding of TNF by a 

25 single TNF-R polypeptide. 

Peptide linkers that may be employed in die present invention separate TNF-R 
polypeptides (and IL-IR polypeptides, when present) from one another by a distance 
sufficient to ensure that each polypeptide properly folds into the secondary and tertiary 
structures necessary for the desired biological activity. The linker also should allow the 

30 extracellular domains of tiie TNF-R and IL-IR polypeptides to assume the proper 
spatial orientation to form a binding site for TNF or IL-1. The peptide linkers function 
as spacers, as opposed to the pharmaceuticaUy active TNF-R and IL-IR polypeptide 
components of the fusion proteins. Suitable polypeptide Imkers preferably (1) will 
adopt a flexible extended conformation, (2) will not exhibit a propensity for developing 

35 an ordered secondary structure which could interact with the functional protein 
domains, and (3) will have minimal hydrophobic or charged character which could 
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promote interaction with the functional protein domains. TNpical siuface amino acids in 
flexible protein regions include glycine (Gly), asparagine (Asn) and serine (Ser). 
Vinually any permutation of amino acid sequences containing Gly, Asn and Ser would 
be expected to satisfy the above criteria for a peptide linker sequence. Other near 
5 neutral amino acids, such as threonine (Thr) and alanine (Ala), may also be used in the 
linker sequence. Suitable peptide linkers generally comprise a chain of amino acids, 
preferably from 5 to 100 amino acids in length and most preferably from 10 to 20 
amino acids in length. Examples of such linkers include, but are not limited to, 
(Gly4Ser)n, wherein n is 1-12, Gly4SerGly5Ser, and (Gly4SerGly5Ser)2. 

TNF-R and IL-IR Polypeptides 

As used herein, the terms "interieukin-1 receptor" and "IL-IR" refer to proteins 
which are capable of binding interieukin-1 (IL-1) molecules and, in their native 
configuration as human plasma membrane proteins, play a role in transducing the signal 

15 provided by IL-1 to a cell. As used herein, the terms "TNF receptor" and "TNF-R" 
refer to proteins that are biologically active in that they are capable of binding tumor 
necrosis factor (TNTF). Native membrane bound forms of the proteins also transduce a 
biological signal initiated by a TNF molecule binding to a cell. For fusion proteins 
comprising more than one TNF-R polypeptide, the TNF-R polypeptides may be 

20 identical or different Likewise for IL-IR polypeptides. 

Intact receptors generally include an extracellular domain which binds to a 
hgand, a hydrophobic transmembrane domain which remains embedded witiiin the 
plasma membrane lipid bilayer, and a cytoplasmic or intracellular domain which is 
believed to deliver a biological signal to effector cells via a cascade of chemical 

25 reactions within the cytoplasm of the cell. The hydrophobic transmembrane domain 
and a highly charged region of the cytoplasmic domain immediately downstream of the 
transmembrane domain cooperatively function to halt transport of the EL-l and TNF 
receptors across the plasma membrane. The extracellular domain of the TNF-R and IL- 
IR proteins disclosed herein is the N-terminal portion of the protein, from amino acid 1 

30 to the amino acid immediately preceding tiie transmembrane region. The cytoplasmic 
domain is that portion of the protein that is located downstream of the transmembrane 



Among \ht TNT-R polypeptides that may be employed as components of the 
mventive fusion proteins is a pol\7)epnde comprising amino acids 1 to 439 of the 
sequence presented in figures 2A-2B, which is a full length native TNF-R sequence. 
This TNT-R DNA and amino acid sequence is also presented in SEQ ID NOS: 1 and 2. 
When the signal sequence is desired, the TNT-R poh'peptide may comprise amino 



region. 
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acids -22 to 439 of the figures 2A-2B sequence. The desirability of including the signal 
sequence depends on such factors as the position of the TNF-R polypeptide in the 
fusion protein and whether the intended host cells will process a mammalian signal 
sequence, as discussed below. The mature full-length native glycosylated form of this 
5 human TNF-R is a glycoprotein having a molecular weight of about 80 kilodaltons 
(kDa). As used throughout the specification, the term "mature" means a protein lacking 
a leader or signal sequence as may be present in full-length transcripts of a native gene. 
A protein may comprise a signal sequence when initially expressed. Cleavage of the 
signal sequence tipon secretion of the protein from the cell yields the mature form of the 
10 protein. 

Odier suitable TNF-R polypeptides are described in European patent application 
publication number 422,339 (EP 422,339 hereinafter), which is hereby incorporated by 
reference in its entirety. Two suitable TNF-R polypeptides comprise arginine (Arg) as 
residue 174 but are otherwise identical to the above-described polypeptides comprising 

15 amino acids 1 to 439 or -22 to 439, respectively, of figures 2A-2B of tiie present 
application. A TNF-R a m i n o acid sequence identical to that of figures 2A-2B (of the 
present application) except for substitution of arginine (Arg) for methionine (Met) at 
position 174 of the mature sequence is disclosed in EP 422,339 (see figure 39 therein). 
The cDNA and encoded amino acid sequences of another useful TNF-R 

20 polypeptide are disclosed in figure 21 of EP 422,339. The coding region of the EP 
422,339 figure 21 cDNA sequence, and the amino acid sequence encoded thereby, are 
presented as SEQ ID NOS:3 and 4 of the present application. Although referred to as a 
30 kilodalton protein in EP 422,339, other molecular weights have been reported for 
this protein. A molecular weight of about 55 kilodaltons was reported by Loetscher et 

25 al. {Cell 61:351, 1990) and in EP 417,563, for example. Useful TNF-R polypeptides 
include tiiose comprising amino acids 1 to 415 of tiie SEQ ID N0:4 sequence, or, when 
the signal sequence is desired, amino acids -40 to 415 of the SEQ ID N0:4 sequence. 
Metiiods for producing this TNF-R protein, eitiier by purification fi-om urine or fi-om 
the medium of a culture of U937 ceUs, or by recombinant DNA technology, are 

30 described in EP 422,339. This TNF-R is characterized by an N-terminal sequence (for 
the mature form of the protein) of Asp-Ser-Val-Cys-Pro-Gln- , whereas the N-temiinal 
sequence of tiie mamre form of tiie TNF-R protein of figures 2A-2C is Leu-Pro-Ala- 
Ghi-Val-Ala- . 

In certain embodiments of the present invention, the TNF-R polypeptide is a 
35 soluble Tf^-R polypeptide. Soluble TNF-R polypeptides lack at least part (preferably 
all) of the transmembrane region tiiat promotes retention of tiie protein on the cell 
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surface. The soluble polypeptides generally also lack the charged region of the 
cyioplasniic domain (located immediately downstream of the transmembrane region) 
that contributes to retention on the cell surface. Preferably, the entire transmembrane 
region and cytoplasmic domain are deleted from the protein or substituted with 
5 hydrophilic amino acids to forai soluble TNF-R, Soluble TNF-R is secreted from the 
cell and retains the desired biological acti\at\'. 

Examples of soluble TNF-R polj-peptides are those comprising amino acids 1 - 
X of the Figure 2A sequence, wherein x is the C-terminal amino acid and is selected 
from the group consisting of any one of amino acids 163 - 235 of Figtire 2A. Specific 

10 examples include polypeptides comprising the acids 1 - 163, 1 - 185, or 1 - 235 of 
Figure 2A. The soluble TNF-R pohpeptide may additionally comprise a signal 
sequence, e.g., amino acids -22 to -1 of Figure 2A. 

Additional examples of soluble TNT-R polypeptides are those comprising 
amino acids 1-184 or 1-182, -22-184, or -22-182 of the figures 2A-2B sequence. 

15 Such proteins may contain either methionine or arginine at position 174. Procedures 
for preparing examples of such TNF-R polypeptides include those described in 
examples 17 and 22 of EP 422,339. 

The TNF-R protein shown in SEQ ID N0:4 comprises a signal peptide 
(designated amino acids -40 to -1) and a transmembrane region beginning with the 

20 valine residue at position 172. Preferred soluble forms of this TNF-R protein include 
those comprising amino acids -40- w or 1-w of SEQ ID NO:4, wherein w is an integer 
from 161-171 (i.e., any of amino acids 161 to 171 of SEQ ID N0:4 is the C-terminus). 
The use of oligonucleotide-directed in vitro mutagenesis to construct an expression 
vector encoding a biologically active TNF-R protein having amino acid 161 

25 (asparagine) as the C-terminal amino acid is illustrated in example 7 of EP 422,339. 
Further, procedures for purifying naturally occurring soluble forms of both of the 
above-described TNT-R proteins (i.e., soluble forms of the SEQ ID N0:2 and the SEQ 
ID N0:4 proteins) from human urine have been described by Engelmann et al. (7. BioL 
Chem. 265:1531, 1990). 

30 Interleukdn-l receptors that may be employed as components of the inventive 

fusion proteins include pol>T)eptides designated herein as type I IL-IR and type 11 DL- 
IR. Type I IL-1 receptors have been detected on T-cells and certain other cell types, 
while expression of t>pe 11 IL-1 receptors on B cells has been reported. In the absence 
of any specific designation, the term "IL-1 receptor" as used herein refers collectively to 

35 type I and type II IL-1 receptors. 
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Among the IL-IR polypeptides that may be employed in the present invention 
are the type I IL-IR polypeptides described in U.S. patent applications serial no. 
07/821,716, filed January 14, 1992; and serial no. 455,488, filed December 21, 1989; 
and in European patent application publication no. 318,296; the disclosures of which 
5 are incorporated herein by reference, in their entireties. The DNA sequence of a cloned 
cDNA encoding a human type I IL-IR protein and the amino acid sequence encoded 
thereby are presented herein in SEQ ID NOS: 5 and 6. The protein contains 569 amino 
acids, of which 20 are an N-terminal signal peptide. The aspartic acid (Asp) residue at 
position 1 is the first amino acid of the mature protein. The transmembrane region 

10 includes amino acids 317 (His) through 336 (Tyr). The sequence of human IL-IR is 
also disclosed in Sims et al., Proc Natl Acad, Sci, USA 5(5:8946 (1989). 

As with the TNF-R polypeptides, soluble IL-IR polypeptides may be employed 
in the fusion proteins of the present invention. Soluble IL-IR polypeptides generally 
lack the transmembrane region and preferably lack the cytoplasmic domain as well. 

15 Soluble IL-iR proteins may also include part of the transmembrane region or the 
cytoplasmic domain, provided tiiat the soluble IL-IR protein is capable of being 
secreted from the cell. 

Examples of soluble type I IL-IR polypeptides include, but are not limited to, 
those comprising the amino acid sequence depicted as amino acids y-x of SEQ ID 

20 NO:6, wherein x is 312-316 and y is -3 to 3. In other words, the N-terminal amino 
acid is selected from the Leu, Glu, Ala, Asp, Lys, and Cys residues at positions -3, -2, 
-1, 1, 2, and 3, respectively. The C-terminal amino acid of the soluble protein is 
selected from the Thr, Asn, Phe, Gin, and Lys residues at positions 312, 313, 314, 
315, and 316, respectively. Preferred soluble IL-IR polypeptides include those 

25 comprising amino acids 1-x of SEQ ID N0:6, wherein x is 312-316. 

Any of the above-described soluble type I IL-IR polypeptides may additionally 
comprise a signal sequence, e.g., the signal sequence shown as amino acids -20 to -1 
in SEQ ID N0:6. Alternatively, a different signal sequence functional in an intended 
host cell (e.g., a yeast signal sequence) may be employed, as discussed below. 

30 Among the type n IL-IR polypeptides that may be employed in the present 

invention are those described in U.S. patent application serial no. 701,415, filed June 
16, 1991; and in European patent application publication no. 460,846; tiie disclosures 
of which are incorporated herein by reference, in their entireties. Native glycosylated 
human type n IL-IR proteins recovered from cell lysates generally have an apparent 

35 molecular weight of about 60-68 kilodaltons by SDS-PAGE. The DNA sequence of a 
cloned cDNA encoding a human type H IL-IR protein and the amino acid sequence 
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encoded thereby are presented herein in SEQ ID NOS:7 and 8. The protein comprises a 
13-amino acid signal peptide. The transmembrane region includes amino acids 331 
(Ala) through 356 (Met). 

Soluble type II IL-IR proteins are derived by deleting a C-ierminal portion of 
5 the protein that is not necessar}' for IL- 1 binding, so that the protein is secreted from the 
cell. The cysteine residue at position 313 is believed to be necessary to maintain the 
tertiar>' structure of the type 11 IL-IR molecule and permit binding of IL-1. Examples 
of soluble type n IL-IR polypeptides thus include those in which the C-terminal amino 
acid is selected from any of amino acids 314-333. In otiier words, the soluble LL-IR 

10 may contain the amino acid sequence shown as amino acids 1-x of SEQ ID NO: 8 
wherein x is an integer from 314-333. Preferred examples of suitable soluble type II 
IL-IR polj'peptides include, but are not limited to, those comprising amino acids 1-x of 
SEQ ED N0:8, wherein x is 330-333. In other words, the C-terminal amino acid of the 
soluble protein is selected from the Glu, Ala, Ser, and Ser residues at positions 330, 

15 331, 332, and 333, respectively. The soluble type II IL-IR polypeptides may 
additionally comprise a signal sequence, e.g., the signal sequence shown as amino 
acids -13 to -1 in SEQ ID NO: 8. Alternatively, a different signal sequence functional in 
an intended host cell (e.g., a yeast signal sequence) may be employed, as discussed 
below. 

20 Assay procedures described herein may be employed to confirm biological 

activity for additional soluble TNT-R and IL-IR polypeptides, beyond the particular 
examples set fonh above. Soluble TNF-R and soluble IL-IR may be identified (and 
distinguished from their non-soluble membrane-bound counterpans) by separating 
intact cells which express the desired protein from the culture medium, e.g., by 

25 centrifugation, and assaying the medium (supernatant) for tiie presence of the desired 
protein. The culture medium may be assayed using procedures which are similar or 
identical to tiiose described in the examples below. The presence of TNF-R and IL-IR 
in the medium indicates that the protein was secreted from the cells and thus is a soluble 
form of the desired protein. 

30 The N- or C- terminus of the TNF-R or IL-IR polypeptides may var>' according 

to such factors as the type of host cells employed when producing the fusion protein via 
recombinant DNA technology and the particular cells from which the protein is purified 
when non-recombinant TNF-R or IL-IR is employed. Such variations may be 
attributable to differential post-translational processing of the protein in various t\7>es of 

35 cells, for example. Variations in the N- or C-terminal sequence also may result from 
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the oligonucleotides chosen to reconstruct either terminus of the TNF-R or IL-IR 
encoding DNA sequence when constructing expression vectors. 

Differential processing may result in mature TNF-R or IL-IR proteins having 
an N-tenninal amino acid other than those shown at position 1 of SEQ ID N0S:2, 4, 6 
5 and 8. For example, in certain host cells, post-translational processing will remove the 
methionine residue encoded by an initiation codon, whereas the methionine residue will 
remain at the N-terminus of proteins produced in other host cells. Further, the N- and 
C-termini have been known to vary for the same protein, depending on the source of 
the protein. In some cases, the deletion of amino acids at either terminus of the protein 

10 may be due to proteolysis, occurring either intracellular^ or during purification. 
Varying N-temiini may also residt from cleavage of the signal peptide in certain host 
cells at a point other than between amino acids -1 and 1 of the disclosed sequences. 

As described in examples 10 and 11 and figure 31 of EP 422,339, a non- 
recombinant mature TNF-R protein purified from human urine lacked two N-terminal 

15 amino acids found in a non-recombinant TNF-R protein purified from the human 
monocyte-like cell line U937. The N-terminal amino acid of the urine-derived TNF-R 
was the alanine residue at position 3 of SEQ ID NO:l. Engeknann et al. (/. Biol. 
Chem. 265:1531, 1990) disclose a protein that binds TNF and was purified from 
human urine in forms truncated to varying degrees at the N-terminus (see the abstract 

20 and page 1533). The N-terminal amino acid sequence of one protein species was Val- 
Ala-Phe-Thr-Pro-, which corresponds to amino acids 5-9 of SEQ ID NO:L Other 
forms of the protein had either phenylalanine (amino acid 7 of SEQ ID NO:l) or 
threonine (amino acid 8 of SEQ ID NO: 1) as the N-terminal amino acid. 

The N- and C- termini of the TNF-R and IL-IR proteins may vary for reasons 

25 that include ±ose discussed above. The N-terminal amino acid may, for example, be 
any of the amino acids at positions 1 to 5 of SEQ ID NOS: 2 or 4 for TNF-R or of SEQ 
ID NOS:6 or 8 for IL-IR. The C-terminus may be truncated deliberately during 
expression vector construction (e.g., in constructiuig vectors encoding soluble proteins 
as described above) or as a result of differential processing which may remove up to 

30 about five C-terminal amino acids, for example. 

Additional TNF-R and IL-IR polypeptides that may be employed retain the 
desired biological activity but vary from the native sequences in tiiat amino acid(s) are 
added to, deleted from, or substituted in the native sequence. The biological activity of 
such proteins can be confirmed using the assays described herein. 

35 Derivatives of TNF-R and IL-IR that are witiiin tiie scope of the invention also 

include various structural forms of the primary protein which retain biological activity. 
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Due to the presence of ionizable amino and carboxyl groups, for example, a TNF-R or 
IL-IR protein may be in the form of acidic or basic salts, or may be in neutral form. 
Individual amino acid residues may also be modified by oxidation or reduction. 

The priman^ amino acid strucmre may be modified by forming covalent or 
aggregative conjugates with other chemical moieties, such as glycosyl groups, lipids, 
phosphate, acetyl groups and the like. Covalent derivatives are prepared by linking 
particular functional groups to amino acid side chains or at the N- or C-termini. 
Naturally occurring variants may result from alternative RNA splicing events. 

Other proteins that may be employed in the inventive fusion proteins include 
conjugates of TNF-R or IL-IR with other polypeptides^ which may be produced by 
synthesis in recombinant culture as N-terminal or C-terminal fusions. For example, the 
conjugated peptide may be a a signal (or leader) peptide sequence at the N-terminal 
region of the protein which co-translationally or post-translationally directs transfer of 
the protein from its site of synthesis to its site of function inside or outside of the cell 
15 membrane or wall (e.g., the yeast a-factor leader). The fusion proteins can comprise 
peptides added to facilitate purification or identification of the fusion protein. Such 
peptides include, for example, poly-His or the antigenic identification peptides 
described in U.S. Patent No. 5,011,912 and in Hopp et al., Bio/Technology 
6:1204,1988. One such peptide is the FLAG® peptide, Asp-Tyr-Lys-Asp-Asp-Asp- 
20 Asp-Lys (DYKDDDDK), which is highly antigenic and provides an epitope reversibly 
bound by a specific monoclonal antibody enabling rapid assay and facile purification of 
expressed recombinant protein. This sequence is also specifically cleaved by bovine 
mucosal enterokinase at tiie residue immediately following the Asp-Lys pairing. 
Fusion proteins capped with this peptide may also be resistant to intracellular 
25 degradation in £. colL A murine hybridoma designated 4E11 produces a monoclonal 
antibody tiiat binds the peptide DYKDDDDK in the presence of certain divalent metal 
cations (as described in U.S. Patent 5,011,912) and has been deposited with the 
American Type Culture Collection under accession no HB 9259. 

The inventive fusion proteins comprise TNF-R or IL-IR with or without 
30 associated narive-panem glycosylation. TNF-R and IL-IR expressed in yeast or 
mammalian expression systems, e.g., COS-7 cells, may be similar or slightly different 
in molecular weight and glycosylation pattern than the native molecules, depending 
upon the expression system. Expression of recombinant proteins in bacteria such as £. 
coli provides non-glycosylated proteins. Proteins having inactivated N-glycosylarion 
35 sites can be produced by oligonucleotide synthesis and ligation or by site-specific 
mutagenesis techniques. These mutant proteins can be produced in a homogeneous, 
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reduced-carbohydrate form in good yield using yeast expression systems. N- 
glycosylation sites in eukaryodc proteins are characterized by the amino acid triplet 
Asn-Ai-Z, where Ai is any amino acid except Pro, and Z is Ser or Thr. In this 
sequence, asparagine provides a side chain amino group for covalent attachment of 
5 carbohydrate. Such a site can be eliminated by substituting another amino acid for Asn 
or for residue Z, deleting Asn or Z, or inserting a non-Z amino acid between Ai and Z, 
or an amino acid other than Asn between Asn and Ai. Known procedures for 
inactivating N-glycosylation sites in proteins include those described in U.S. Patent 
5,071,972 and EP 276,846. Examples of N-glycosylation sites in human type n IL- 
10 IR are amino acids 53-55, 59-61, 99-101, 206-208, and 264-266 in SEQ ID NO:8. 
Potential N-glycosylarion sites are found in the type I IL-IR protein (SEQ ID NO:6) at 
amino acids 80-82, 173-175, 213-215, 229-231, 243-245, and 277-279. N- 
glycosylation sites are found at amino acid 171-173 and 358-360 of TNF-R in Figures 
2A-2B. 

15 Cysteine residues that are not essential for biological activity can be deleted or 

replaced with other amino acids to prevent formation of unnecessary or incorrect 
intramolecular disulfide bridges upon renaturation. The cysteine residue at position 
178 in the Figures 2A-2B TNF-R sequence can be deleted, for example. U.S. Patent 
4,518,584 describes the use of site directed mutagenesis to delete or replace cysteine 

20 residues within a protein. Other approaches to mutagenesis involve modification of 
adjacent dibasic amino acid residues to enhance expression in yeast systems in which 
KEX2 protease activity is present. EP 212,914 discloses the use of site-specific 
mutagenesis to inactivate KEX2 protease processing sites in a protein. In order to 
preserve the biological activity of TNF-R and IL-IR, substimtions will preferably result 

25 in homologous or conservatively substituted sequences, meaning that a given amino 
acid residue is replaced by a residue having similar physiochemical characteristics. 
Examples of conservative substitutions include substitution of one aliphatic residue for 
another, such as De, Val, Leu, or Ala for one another, or substitutions of one polar 
residue for another, such as between Lys and Arg; Glu and Asp; or Gin and Asn. 

30 Otiier such conservative substitutions, for example, substitutions of entire regions 
having similar hydrophobicity characteristics, are well known. Moreover, particular 
amino acid differences between human, murine and other mammalian TNF-Rs is 
suggestive of additional conservative substitutions that may be made without altering 
the essential biological characteristics of TNF-R or IL-IR. 



number of known techniques. Mutations can be introduced at particular loci by 



35 



Alterations of the native amino acid sequence may be accomplished by any of a 
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synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites 
enabling ligation to fragments of the native sequence. Following ligation, the resulting 
reconstructed sequence encodes an analog having the desired amino acid insertion, 
substitution, or deletion. 
5 Altematively, oligonucleotide -directed site-specific mutagenesis procedures can 

be employed to provide an altered gene having particular codons altered according to 
the substitution, deletion, or insertion required. Exemplar)' methods of making the 
alterations set fonh above are disclosed by Walder et al. (Gene 42:133, 1986); Bauer et 
al. (Gene 57:73, 1985); Craik {BioTechniques, January 1985, 12-19); Smith et al. 
10 (Genetic Engineering: Principles and Methods, Plenum Press, 1981 ); and U.S. Patent 
Nos. 4,518,584 and 4.737,462 disclose suitable techniques, and are incorporated by 
reference herein. 

The variant amino acid sequence preferably is at least 80% identical, most 
preferably at least 90% identical, to the native sequence. Percent similarity may be 
1 5 determined, for example, by comparing sequence infonnation using the GAP computer 
program, version 6.0, available from the University of Wisconsin Genetics Computer 
Group GJWGCG). The GAP program utilizes the alignment method of Needleman 
and Wunsch (J. Mol. Biol. 48:443. 1970), as revised by Smith and Waterman (Adv. 
Appl. Math. 2:482, 1981). Briefly, the GAP program defines similaritv' as the number 
20 of aligned symbols (i.e., nucleotides or amino acids) which are similar, divided by the 
total number of symbols in the shorter of the two sequences. The preferred default 
parameters for the GAP program include: (1) a unary comparison matrix (containing a 
value of 1 for identities and 0 for non-identities) for nucleotides, and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids Res. 74:6745, 1986, as 
described by Schwartz and Dayhoff, eds., Adas of Protein Sequence and Structure, 
National Biomedical Research Foundation, pp. 353-358, 1979; (2) a penalty of 3.0 for 
each gap and an additional 0.10 penalt>' for each sv-mbol in each gap; and (3) no penalty 
for end gaps. 

Other proteins capable of blocking the binding of IL-1 to cellular receptors in 
vivo may be substituted for the IL-IR polypeptides described above in the fusion 
proteins of the present invention. Such proteins, generally referred to as IL-1 receptor 
antagonists, include those described by Eisenberg et al. (Nature 343:341, 1990), 
Hannum et al. (Nature 343:336, 1990), and Caner et al. (Nature 344:633, 1990), 
which are hereby incorporated by reference in their entireties. These antagonist 
35 proteins bind to IL-1 receptors, but have no IL-1 -like activity (e.g., do not transduce a 
signal or otherwise produce the biological effects that result from binding of IL-1 to a 



25 



30 
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cellular IL-1 receptor). The antagonist proteins compete with IL-1 for binding to 
endogenous IL-1 receptors, thus inhibiting biological effects mediated by IL-1 in vivo, 

DNA Sequences Encoding Recombinant Fusion Proteins 
5 Isolated DNA sequences encoding the above-described fusion proteins are also 

provided by the present invention. A DNA sequence encoding a fusion protein of the 
present invention is constructed using recombinant DNA techniques to insert DNA 
fragments encoding the IL-IR or TNF-R polypeptides into an appropriate expression 
vector. The 3' end of a DNA fragment encoding TNF-R is ligated {via a peptide linker) 

10 to the 5' end of the DNA fragment encoding IL-IR with the reading frames of the 
sequences in phase to permit transladon of the mRNA into a single biologically acdve 
fusion protein. Alternatively, the 3' end of a DNA fragment encoding IL-IR may be 
ligated {via a peptide linker) to the 5' end of the DNA fragment encoding TNF-R, with 
the reading frames of the sequences in phase to permit translation of the niRNA into a 

15 single biologically active fusion protein. An additional sequence encoding TNF-R may 
be ligated in the same reading frame to produce a sequence encoding a fusion protein 
comprising two TNF-R polypeptides and one IL-IR polypeptide. The IL-lR-encoding 
sequence is preferably positioned upstream of the TNF-R-encoding sequence(s). 
While the fusion protein may comprise two IL-IR polypeptides along with the TNF-R 

20 polypepride(s), one IL-IR is preferred. A single IL-IR provides the desu-ed high 
affinity IL-1 binding acrivitj' without the possible disadvantages of increasing the size 
of the fusion protein by adding a second IL-IR. Such disadvantages may include 
increased complexity of vector construction procedures and possible reduction in the 
level of expression of the desired protein. In another embodiment of the present 

25 invention, a DNA sequence encoding TNF-R is ligated to a linker sequence which in 
turn is ligated to a second TNF-R encoding sequence. 

A DNA sequence encoding an N-tenninal signal sequence may be retained on 
the DNA sequence encoding the N-terminal polypeptide, while stop codons, which 
would prevent read-through to the downstream DNA sequence(s), are eliminated. 

30 Conversely, a stop codon required to end translation is generally retained on the DNA 
sequence encoding the C-terminal polypeptide. DNA encoding a signal sequence is 
preferably removed from DNA sequences other tiian tiiose encoding the N-terminal 
polypeptide. 

A DNA sequence encoding a desired peptide linker may be insened between, 
35 and in the same reading frame as, the DNA sequences encoding TNF-R or IL-IR using 
any suitable conventional technique. For example, a chemically synthesized 
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oligonucleotide encoding the linker and containing appropriate restriction endonuclease 
cleavage sites may be ligated between the sequences encoding TNF-R or IL-IR. 
Alternatively, a chemically synthesized DNA sequence may contain a sequence 
complementary to the 3' terminus (without the stop codon) of either TNF-R or IL-IR 
followed by a linker-encoding sequence which is followed by a sequence 
complememar\' to the 5' terminus of the other of TNF-R and IL-IR. Oligonucleotide 
directed mutagenesis is then employed to insen the linker-encoding sequence into a 
vector containing a direct fusion of TNF-R and EL-IR. Another technique employs 
polymerase chain reactions using primers comprising, in pan. single strand segments 
encoding a peptide linker. PCR-generated DNA fragments encoding two different 
proteins can be joined through annealing of the complementar\' single stranded linker- 
encoding segments present at a terminus of each fragment. Preferred procedures for 
inserting a linker-encoding DNA segment between TNF-R and IL-IR DNA sequences 
(or between two TNF-R DNA sequences) are described in examples 1 1 and 12 below. 
15 DNA sequences encoding TNF-R and EL-IR may be isolated by any suitable 

conventional procedure, for use in constructing the fusion protein-encoding DNA 
sequences of the present invention. DNA sequences encoding fusion proteins to be 
expressed in a microorganism will preferably contain no introns that could prematurely 
terminate transcription of DNA into mRNA; however, premature termination of 
20 transcription may be desirable, for example, where it would result in mutants having 
advantageous C-terminal truncations, for example, deletion of a transmembrane region 
to yield a soluble receptor not bound to the cell membrane. Among the suitable 
procedures for cloning type I and type H IL-IR cDNA are tiiose presented in examples 
8 and 9, respectively. TNF-R cDNA may be isolated by procedures that include those 
25 described in example 3. 

The coding sequence of TNF-R may be obtained by isolating a sequence 
encoding TNF-R from a recombinant cDNA or genomic DNA library. A cDNA libraiy 
is preferably constructed by obtaining poiyadenylated mRNA from a particular cell line 
which expresses a mammalian TNF-R, for example, the human fibroblast cell line WI- 
30 26 VA4 (ATCC CCL 95.1) and using die mRNA as a template for syntiiesizing double 
stranded cDNA. The double stranded cDNA is then packaged into a recombinant 
vector, which is introduced into a host cell (e.g., an appropriate £. coli strain) and 
propagated. TNF-R sequences contained in tiie cDNA library can be identified by 
screening the Iibrar>^ with an appropriate nucleic acid probe which is capable of 
35 hybridizing with human TNF-R cDNA. Anotiier cloning technique tiiat may be 
employed is the direct expression procedure described in example 3 below. 
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Alternatively, DNAs encoding TNF-R proteins can be assembled by ligation of 
synthetic oligonucleotide subunits corresponding to aU or part of the sequence of 
Figures 2A-2B to provide a complete coding sequence. 

Additional cDNA clones can be isolated from cDNA libraries of other 
5 mammalian species by cross-species hybridization. For use in hybridization, DNA 
encoding TNF-R or IL-IR may be covalently labeled with a detectable substance such 
as a fluorescent group, a radioactive atom or a chemiluminescent group by methods 
well known to those skilled in the pertinent art. 

Like most mammalian genes, mammalian TNF receptors and IL-1 receptors are 

10 presumably encoded by multi-exon genes. Alternative mRNA constructs which can be 
attributed to different mRNA splicing events following transcription, and which share 
large regions of identity or similarity with the cDNAs disclosed herein such that 
biologically active TNF-R or IL-IR is encoded thereby, are considered to be useful in 
preparing the fusion proteins of the present invention. 

15 DNA encoding soluble TNF-R and IL-IR polypeptides may be prepared by any 

of a number of conventional techniques. A DNA fragment encoding a desired soluble 
polypeptide may be subcloned into an expression vector. DNA fragments may be 
produced by restriction endonuclease digestion of a full length cloned DNA sequence, 
and isolated by electrophoresis on agarose gels. Alternatively, a desired DNA sequence 

20 may be chemically synthesized using known techniques. Linkers containing restriction 
endonuclease cleavage site(s) may be employed to insert the desired DNA fragment into 
an expression vector, or the fragment may be digested at cleavage sites naturally present 
therein. 

The well known polymerase chain reaction (PGR) procedures also may be 
25 employed to isolate a DNA sequence encoding a desired soluble protein fragment. This 
technique is illustrated in the examples below. 

In another approach, enzymatic treatment (using Bal 31 exonuclease) may be 
employed to delete terminal nucleotides from a DNA fragment to obtain a fragment 
having a particular desired terminus. Among tiie commercially available linkers are 
30 those that an be ligated to the blunt ends produced by Bal 31 digestion, and which 
contain restriction endonuclease cleavage site(s). Alternatively, oligonucleotides that 
reconstruct the N- or C- terminus of a DNA fragment to a desired point may be 
synthesized. The oligonucleotide may contain a restriction endonuclease cleavage site 
upstream of the desired coding sequence and position an initiation codon (ATG) at the 
35 N-terminus of the coding sequence. 
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The TNF-R and IL-IR DNA sequences may vaT>' from those presented in SEQ 
ID N0S:1, 3, 5 and 7. Due to the known degeneracy of the genetic code, there can be 
considerable variation in nucleotide sequences encoding the same amino acid sequence, 
for example. DNA sequences capable of hybridizing to the DNA sequences of SEQ ID 
5 N0S:1, 3, 5 and 7 under moderately stringent conditions (55'C, 5 X SSC), and which 
encode a biologically active TNT-R or IL-IR polypeptide, are also considered to be 
TNT-R-encoding or IL-lR-encoding DNA sequences, respectively, in the context of 
the present invention. Mutations may be deliberately made to the native DNA 

sequences, e.g. to produce the amino acid substitutions, deletions and insertions 

10 described above. Cenain of the mutations will not be expressed in the final protein 
product. For example, nucleotide substitutions may be made to enhance expression, 
primarily to avoid secondary structure loops in the transcribed mRNA (see EP 75,444, 
incorporated herein by reference). Other alterations of the nucleotide sequence may be 
made to provide codons that are more readily translated by the selected host, e.g., the 

15 well-known E, coli preference codons for E. coli expression. Silent mutations 
(changes in the DNA sequence that do not alter the encoded amino acid sequence) also 
may occur during polymerase chain reactions. In one embodiment of the present 
invention, nucleotide number 437 of SEQ ID N0:5 (type I IL-IR) is changed from a T 
to a C. This silent mutation occurred during a polymerase chain reaction. 

20 Mutations in nucleotide sequences should, of course, preserve the reading 

frame phase of the coding sequences. The mutations preferably will not create 
complementary regions that could hybridize to produce secondary mRNA structures 
such as loops or hairpins which would adversely affect translation of the fusion protein 
mRNA. 

25 The present invention thus provides inventive DNA sequences encoding the 

above-described fusion proteins, wherein each TNF-R DNA sequence in the fusion 
protein DNA sequence is selected from: (a) DNA sequences derived from the coding 
region of a native mammalian TNF-R gene (e.g., cDNA derived from the coding region 
of SEQ ID N0S:1 or 3); (b) DNA sequences capable of hybridization to a DNA 

30 sequence of (a) under moderately stringent conditions (50'C, 2x SSC) and which 
encode biologically active TNF-R and (c) DNA sequences which are degenerate as a 
result of the generic code to tiie DNA sequences defined in (a) or (b) and which encode 
biologically active TNT-R. The fusion protein DNA sequence likewise may comprise 
IL-IR encoding DNA sequence(s) selected from: (a) DNA sequences derived from the 

35 coding region of a native mammalian IL-IR gene (e.g., cDNA derived from the coding 
region of SEQ ID NOS:5 or 7); (b) DNA sequences capable of hybridization to a DNA 
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sequence of (a) under moderately stringent conditions (55*C, 5x SSC) and which 
encode biologically active IL-IR; and (c) DNA sequences which are degenerate as a 
result of the genetic code to the DNA sequences defined in (a) or (b) and which encode 
biologically active EL-IR. 

5 

Expression of Recombinant Fusion Proteins 

The present invention provides recombmant expression vectors to express 
DNA encoding the fusion proteins of the present invention. The inventive recombinant 
expression vectors are replicable DNA constructs which contain a synthetic or cDNA- 

10 derived DNA sequence encoding one of the above-described fusion proteins, operably 
linked to suitable transcriptional or translational regulatory elements. Examples of 
genetic elements having a regulatory role in gene expression include transcriptional 
promoters, operators or enhancers, a sequence encoding suitable mRNA ribosomal 
binding sites, and appropriate transcription and translation initiation and termination 

15 sequences. The ability to replicate in a host, usually conferred by an origin of 
replication, and a selection gene to facilitate recognition of transformants may 
additionaUy be incorporated. The regulatory elements employed in the expression 
vectors are generally derived from mammalian, microbial, viral, or insect genes. 
Expression vectors derived from retroviruses also may be employed. 

20 DNA regions are operably linked when they are functionally related to each 

otiier. A DNA sequence encoding a fusion protein is said to be operably linked to one 
or more of the above-described regulatory elements when die fusion protein DNA 
sequence is transcribed, or the resulting mRNA is translated, under tiie control of the 
regulatory element(s). 

Transformed host cells are cells which have been transformed or transfected 
with foreign DNA using recombinant DNA techniques. In die context of the present 
invention, tiie foreign DNA includes a sequence encoding the inventive fusion protein. 
Host cells may be transformed for purposes of cloning or amplifying the foreign DNA, 
or may be transformed witii an expression vector for production of tiie fusion protein 

30 under the control of appropriate promoters. Suitable host cells include prokaryotes, 
yeast, or higher eukaryotic cells. Appropriate cloning and expression vectors for use 
with bacterial, fungal, yeast, and mammalian cellular hosts are described by Pouwels et 
al. {Cloning Vectors: A Laboratory Manual Elsevier, New York, 1985), the relevant 
disclosures of which is hereby incorporated by reference. CeU-free translation systems 

35 could also be employed to produce fusion protein using RNAs derived from die DNA 
constructs of the present invention. 
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Prokaryotes include gram negative or gram positive organisms. Prokaryotic 
expression vectors generally comprise one or more phenotypic selectable markers, for 
example a gene encoding proteins conferring antibiotic resistance or supphing an 
autotrophic requirement, and an origin of replication recognized by the host to ensure 
5 amplification within the host. Examples of suitable prokaryotic hosts for 
transformation include E. coli, bacilli such as Bacillus subtilis, Salmonella 
xyphimurium, and various species wthin the genera Pseudomonas, Streptomyces, and 
Staphylococcus, although others may also be employed as a matter of choice. 

Useful expression vectors for bacterial use can comprise a selectable marker and 
1 0 bacterial origin of replication derived from commercially available plasmids comprising 
generic elements of the well-known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, 
Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the 
15 structural sequence to be expressed. £. coli is typically transformed using derivatives 
of pBR322, a plasmid derived from an E. coli species CBolivar et al.. Gene 2:95, 
1977). pBR322 contains genes for ampicillin and tetracycline resistance, providing 
simple means for identifying transformed cells. 

Promoters commonly used in recombinant microbial expression vectors include 
20 the b-lactamase (penicillinase) and lactose promoter system (Chang et al.. Nature 
275:615, 1978; and Goeddel et al. Nature 257:544, 1979), the tr>'ptophan (trp) 
promoter system (Goeddel et al„ NucL Acids Res. S:4057, 1980; and EPA 36,776) 
and tac promoter (Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laborator>\ p. 412, 1982). A particularly useful bacterial expression system 
25 employs the phage X Pl promoter and cI857ts thermoinducible repressor. Plasmid 
vectors available from the American Type Culture Collection which incorporate 
derivatives of the ?l Pl promoter include plasmid pHUB2, resident in £. coli strain 
JMB9 (ATCC 37092) and pPLc28, resident in E. coli RRl (ATCC 53082). 

The recombinant fusion protein may also be expressed in yeast hosts, 
30 preferably from Saccharomyces species, such as 5. cerevisiae. Yeast of other genera 
such as Pichia or Kluyveromyces may also be employed. Yeast vectors will generally 
contain an origin of replication from the 2um yeast plasmid or an autonomously 
replicating sequence (ARS), a promoter, DNA encoding the fusion protein, sequences 
for polyadenylation and transcription termination and a selection gene. Preferably, 
35 yeast vectors will include an origin of replication and selectable markers permitting 
transformation of both yeast and £. coli, e.g., the ampicillin resistance gene of E. coli 
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and the S. cerevisiae trpl gene, which provides a selection marker for a mutant strain of 
yeast lacking the ability to grow in tryptophan, and a promoter derived from a highly 
expressed yeast gene to induce transcription of a stmctural sequence downstream. The 
presence of the trpl lesion in the yeast host cell genome then provides an effective 
5 environment for detecting transfomiarion by growth in the absence of tryptophan. 

Suitable promoter sequences in yeast vectors include the promoters for 
metallothionein, 3-phosphoglycerate kinase (Hitzeman et al., J. Biol. Chem, 255:2073, 
1980) or other glycolytic enzymes (Hess et al., J. Adv. Enzyme Reg, 7:149, 1968; and 
Holland et al., Biochem, i7:4900, 1978), such as enolase, glyceraldehyde-3-phosphate 
10 dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase and glucokinase. Suitable vectors and 
promoters for use in yeast expression are further described in R. Hitzeman et al., EPA 
73,657. 

15 Prefeired yeast vectors can be assembled using DNA sequences from pBR322 

for selection and replication in £. coli (Amp^ gene and origin of replication) and yeast 
DNA sequences including a glucose-repressible ADH2 promoter and a-factor secretion 
leader. The ADH2 promoter has been described by Russell et al. (/. BioL Chem. 
258:161 A, 1982) and Beier et al., {Nature 300:724, 1982). Advantageously, a DNA 

20 segment encoding a leader sequence functional in yeast is operably linked to the 5' end 
of the DNA encoding the fusion protein. The encoded leader peptide promotes 
secretion of the fusion protein from the host cell and is generally cleaved from the 
fusion protein upon secretion. As one example, the yeast a-factor leader, which directs 
secretion of heterologous proteins, can be inserted between the promoter and the 

25 structural gene to be expressed. See, e.g., Kuijan et al., Cell 30:922, 1982; and Bitter 
et al., Proc. Natl Acad. Sci. USA Si:5330, 1984. The leader sequence may be 
modified to contain, near its 3' end, one or more useful restriction sites to facilitate 
fusion of the leader sequence to foreign genes. 

Suitable yeast transformation protocols are known to those of skill in the art. 

30 An exemplary technique is described by Hinnen et al., Proc. Natl. Acad. Sci. USA 
75:1929, (1978), selecting for Tip"*" transfomiants in a selective medium consisting of 
0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, lO^g/ml adenine and 20 
M-g/ml uracil. Host strains transformed by vectors comprising the above-described 
ADH2 promoter may be grown for expression in a rich medium consisting of 1% yeast 

35 extract, 2% peptone, and 1% glucose supplemented with 80 |ig/ml adenine and 80 
lig/ml uracil. Derepression of die ADH2 promoter occurs upon exhaustion of medium 
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glucose. Crude yeast supematants are harv^ested by filtradon and held at 4''C prior to 
further purificadon. 

Various maminalian or insect cell culture systems can be employed to express 
recombinant protein. Baculovirus systems for producdon of heterologous proteins in 
5 insect cells are reviewed by Luckow and Summers, Bio/Technology 6:41 (1988). 
Established cell hnes of mammalian origin may be employed. Examples of suitable 
mammalian host cell lines include the COS-7 lines of monkey kidney cells, described 
by Gluzman {Cell 23:115, 1981), L cells, C127, 3T3, Chinese hamster ovar>' (CHO), 
HeLa and BHK cell lines. Mammalian expression vectors may comprise non- 
10 transcribed elements such as an origin of replication, a suitable promoter and enhancer 
linked to the gene to be expressed, and other 5' or 3' flanking nontranscribed 
sequences, and 5' or 3' nontranslated sequences, such as necessar\^ ribosome binding 
sites, a poly-adenylation site, splice donor and acceptor sites, and transcriptional 
termination sequences. 

15 The transcriptional and translauonal control sequences in expression vectors to 

be used in transforming vertebrate cells may be provided by viral sources. For 
example, commonly used promoters and enhancers are derived from Polyoma, 
Adenovirus 2, Simian Virus 40 (SV40), and human cytomegalovirus. DNA sequences 
derived from the SV40 viral genome, for example, SV40 origin, early and late 

20 promoter, enhancer, splice, and polyadenylation sites may be used to provide the other 
genetic elements required for expression of a heterologous DNA sequence. The early 
and late promoters are particularly useful because both are obtained easily from the 
virus as a fragment which also contains the SV40 viral origin or replication (Fiers et al., 
Nature 275:1 13, 1978). SmaUer or larger SV40 fragments may also be used, provided 

25 the approximately 250 bp sequence extending from the Hind HI site toward the fi^H 
site located in the viral origin of replication is included. Exemplar>^ vectors can be 
constructed as disclosed by Okayama and Berg {MoL Cell. Biol i:280, 1983). A 
useful system for stable high level expression of mammalian receptor cDNAs in C127 
murine mammary epithelial cells can be constructed substantially as described by 

30 Cosman et al. {MoL Immunol 23:935, 1986). 



Producing and Purifying the Fusion Protein 

The present invention provides a process for producing the recombinant fusion 
protein of the present invention, comprising culturing a host cell transformed with an 
35 expression vector comprising a DNA sequence that encodes said fusion protein under 
conditions that promote expression of the fusion protein, which is tiien purified from 
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culture media or cell extracts. Any suitable purification process may be employed, with 
the procedure of choice varying according to such factors as the type of host cells and 
whether or not the desired protein is secreted from the host cells. The fusion protein 
will be secreted into the culture medium when it is initially fused to a signal sequence or 
5 leader peptide operative in the host cells, or when the protein comprises soluble forms 
of the TNF-R and IL-IR polypeptides. 

For example, supematants from expression systems which secrete recombinant 
protein into the culture medium can be first concentrated using a commercially available 
protein concentration filter, for example, an Amicon or Millipore Pellicon ultrafiltration 

10 unit. Following the concentration step, the concentrate can be applied to a suitable 
purification matrix. For example, a suitable affinity matrix can comprise TNF or IL-L 
An affmity matrix may be prepared by coupling recombinant human TNF or IL-1 to 
cyanogen bromide-activated Sepharose (Pharmacia) or Hydrazide Afngel (Biorad), 
according to manufacturer's recommendations. A preferred piuification procedure 

15 involves sequential immunopurification using antibodies bound to a suitable support. 
Proteins binding to an antibody specific for TNF-R are recovered and contacted with 
antibody specific for IL-IR on an insoluble support. Proteins immunoreactive with 
both antibodies may thus be identified and isolated. A monoclonal antibody specific for 
human type I IL-IR was deposited with the American Type Culture Collection under 

20 accession number HB10556 on September 13, 1990. Alternatively, an anion exchange 
resin can be employed, for example, a matrix or substrate having pendant 
dietiiylaminoethyl (DEAE) groups. The matrices can be acrylamide, agarose, dextran, 
cellulose or other types commonly employed in protein purification. Alternatively, a 
cation exchange step can be employed. Suitable cation exchangers include various 

25 insoluble matrices comprising sulfopropyl or carboxymethyl groups. Sulfopropyl 
groups are preferred. One or more reversed-phase high performance liquid 
chromatography (RP-HPLC) steps employing hydrophobic RP-HPLC media, e.g., 
silica gel having pendant methyl or other aliphatic groups, can be employed to further 
purify a fusion protein composition. 

30 Recombinant protein produced in bacterial culture is usually isolated by initial 

extraction from cell pellets, followed by one or more concentration, salting-out, 
aqueous ion exchange or size exclusion chromatography steps. Finally, high 
performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of recombinant fusion proteins can 

35 disrupted by any convenient method, including freeze-thaw cycling, sonication, 
mechanical disruption, or use of cell lysing agents. 
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Fermentation of yeast which express fusion proteins as a secreted protein 
greatly simplifies purification. Secreted recombinant protein resulting from a large- 
scale fermentation can be purified by methods analogous to those disclosed by Urdal et 
al. (J, Chromaiog, 296:171, 1984), involving two sequential, reversed-phase HPLC 
5 steps for purification of a recombinant protein on a preparative HPLC column. 

Some or all of the foregoing puritlcation steps, in various combinations, can be 
employed to provide an essentially homogeneous recombinant protein. Recombinant 
cell culture enables the production of the fusion protein free of those contaminating 
proteins which may be normally associated with TNF-R or IL-IR as they are found in 

10 nature in their respective species of origin, e.g., in cells, cell exudates or body fluids. 
The foregoing purification procedures are among those that may be employed to purify 
non-recombinant receptors of the present invention as well. 

As an alternative to production of the inventive receptors as fusion proteins, the 
TNF-R and IL-IR proteins may be separately produced and purified, and subsequently 

15 linked together. Numerous reagents useful for crosslinking one protein molecule to 
another are known. Heterobifunctional and homobifunctional linkers are available for 
this purpose from Pierce Chemical Company, Rockford, Illinois, for example. Such 
linkers contain two functional groups (e.g., esters and/or maleimides) that will react 
with cenain functional groups on amino acid side chains (e.g., amines on lysine 

20 residues and sulfhydrj'ls generated on cysteine residues by reduction), thus linking one 
polypeptide to another. Examples of such crosslinking reagents are N- 
maleimidobenzoyl succinimidyl ester and N-hydroxysuccinimide. The reagent and 
reaction conditions should be chosen such that the cross-linking does not interfere with 
binding of TNR or IL-1 to the receptor. The TNT-R and IL-IR polypeptides are 

25 preferably linked via one of the above-described peptide linkers that functions as a 
spacer. A peptide linker may be attached to TNF-R or to IL-IR by any of the 
conventional procedures used to attach one polypeptide to another. The cross-linking 
reagents available from Pierce Chemical Company as described above are among those 
that may be employed. Amino acids having side chains reactive with such reagents 

30 may be included in the peptide linker, e.g., at the termini thereof 



Pharmaceutical Compositions 

The present invention provides pharmaceutical compositions comprising any of 
the above-described fusion proteins and a physiologically acceptable carrier, diluent, or 
35 excipient. Such carriers, excipients and diluents will be nontoxic to recipients at the 
dosages and concentrations employed. Such compositions may comprise buffers, 
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antioxidants such as ascorbic acid, low molecular weight (less than about 10 residues) 
polypeptides, proteins, amino acids, carbohydrates including glucose, sucrose or 
dextrins, chelating agents such as EDTA, glutathione and other stabilizers and 
excipients. Neutral buffered saline or saline mixed with conspecific serum albumin are 
5 exemplary appropriate diluents. Preferably, the composition is formulated as a 
lyophilizate using appropriate excipient solutions (e.g., sucrose) as diluents. 
Appropriate dosages can be determined in clinical trials. The amount and frequency of 
administration will depend, of course, on such factors as the nature and severity of the 
indication being treated, the desired response, the condition of die patient, and so forth. 

10 Conditions mediated by either TNF or IL-1 may be treated by administering a 

therapeutically effective amount of a fusion protein of the present invention in the form 
of a pharmaceutical composition, to a patient afflicted with such a disorder. A disorder 
is said to be mediated by TNF or IL-1 when TNF or IL-1 causes (direcdy or indirectiy) 
or exacerbates the disorder. Soluble receptor proteins can be used to competitively bind 

15 to TNF or IL-1, thereby inhibiting binding of TNF and IL-1 to cell surface receptors. 

For therapeutic use, purified fusion proteins of the present invention are 
administered to a patient, preferably a human, for treatment in a manner appropriate to 
the indication. Thus, for example, the pharmaceutical compositions can be 
administered by bolus injection, continuous infusion, sustained release from implants, 

20 or other suitable technique. 

The fusion protein employed in the pharmaceutical compositions should be 
purified, in tiiat tiie fusion protein is substantially free of otiier proteins of natural or 
endogenous origin and contains less than about 1% by mass of protein contaminants 
residual of production processes. Such compositions, however, can contain other 

25 proteins added as stabilizers, carriers, excipients or co-therapeutics. The fusion protein 
is purified to substantial homogeneity if it is detectable as a single protein band in a 
polyacrylamide gel by silver staining. 

The fusion proteins of tiie present invention may be administered to treat 
conditions believed to be mediated, at least in part, by TNF, such as cachexia, 

30 rheumatoid artiiritis, diabetes, multiple sclerosis, puhnonary fibrosis and silicosis, 
cerebral malaria, and allograft and xenograft rejection in graft versus host disease. 
TNF has also been implicated in sepsis and septic shock. Bacterial endotoxin can cause 
sepsis in mammals infected witii certain types of bacteria, and is beUeved to stimulate 
macrophages to produce factors that include TNF. Folks et al. {PNAS USA 56:2365, 

35 1989) suggests that TNF-a plays an important role in the pathogenesis of HIV 
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infection. TNF-a induced expression of HIY in a cell line employed as a model of HPv^ 
latency to study conversion from a latent to a productive infection. 

Cenain cytokines (TL-1, IL-2 and other colony stimulating factors) can induce 
significant host production of TNF. Fusion proteins of the formula TNF-R-linker- 
5 TNF-R thus may be used to treat side effects associated with cytokine therapy. 
Because of the primar\' role IL-1 plays in the production of TNF, fusion proteins 
comprising both IL-1 receptor(s) and TNF receptor(s) may be preferred in the treatment 
of TNF-associated clinical indications. 



10 proteins that bind both TNT and IL-1 may be employed in treating conditions mediated 
by IL-1. The inventive fusion proteins can be administered, for example, for the 
purpose of suppressing immune responses in a human. A variety of diseases or 
conditions are caused by an immune response to alloanrigen. In alloantigen-induced 
immune responses, IL-IR suppresses lymphoproliferation and inflammation which 

15 result upon activation of T cells. EL-IR thus may be used to suppress alloantigen- 
induced immune responses in the clinical treatment of, for example, rejection of 
allografts (such as skin, kidney, and heart transplants), and graft-versus-host reactions 
in patients who have received bone marrow transplants. EL-l is believed to play a 
causative role in allergies and autoimmune dysfunctions (such as rheumatoid arthritis, 

20 diabetes, and multiple sclerosis, which are dependent upon the activation of T cells 
against antigens not recognized as being indigenous to the host. 

TNF and IL-1 have been implicated in a number of the same diseases, as can be 
seen by comparing the lists of TNF-mediated and IL-l-mediated conditions presented 
above. In addition, TNF and EL-l are two of the major mediators of inflammation, 

25 often acting in concert. The fusion proteins of the present invention that comprise 
receptors for both TNTF and IL-1 thus offer advantages in the treatment of a number of 
conditions in which both TNF and IL-1 are beheved to play a causative role. 

The use of soluble forms of IL-1 R and TNF-R in the inventive fusion proteins 
is advantageous for certain applications. Purification of the proteins from recombinant 

30 host cells is facilitated, since the soluble proteins are secreted from the cells. Further, 
soluble proteins are generally more suitable for intravenous administration and may 
exen their therapeutic effect (binding IL-1 and/or TNT) in the bloodstream. By binding 
IL-1 and/or TNT, the soluble fusion proteins will inhibit signal transduction via 
endogenous cell surface receptors forlL-l or TNF. 



TNT has been reponed to induce secretion of EL-l in vivo. Thus, fusion 



wo 93/19777 PCr/US93/02938 




The inventive fusion proteins may also be used as reagents in receptor-based 
immunoassays, reagents in assays for TNF or IL-1, or as binding agents for affinity 
purification or TNF or IL-l. 

The following examples are offered by way of illustration, and not by way of 
5 limitation. 



EXAMPLES 

10 Example 1 

TNF Binding Assays 

A. Radiolabeling of TNF a and TNFp, Recombmant human TNFoc, in the form 
of a fusion protein containing a hydrophilic octapeptide at the N-terminus, was 

15 expressed in yeast as a secreted protein and purified by affinity chromatography (Hopp 
et Bio /Technology 6:1204, 1988). Purified recombinant human TNFp was 
purchased fi-om R&D Systems (Minneapolis, MN). Both proteins were radiolabeled 
using the commercially available solid phase agent, lODO-GEN (Pierce). In this 
procedure, 5 ^ig of lODO-GEN were plated at the bottom of a 10 x 75 mm glass tube 

20 and incubated for 20 minutes at 4'*C with 75 ^ of 0.1 M sodium phosphate, pH 7.4 
and 20 ^1 (2 mCi) Na I25i^ jhig solution was then transferred to a second glass tube 
containing 5 jig TNFa (or TNFp) in 45 \i\ PBS for 20 minutes at 4'C. The reaction 
mixture was fractionated by gel filtration on a 2 ml bed volume of Sephadex G-25 
(Sigma) equilibrated in Roswell Park Memorial Institute (RPMI) 1640 medium 

25 containing 2.5% (w/v) bovine serum albumin (BS A), 0.2% (w/v) sodium azide and 20 
mM Hepes pH 7.4 (binding medium). The final pool of l^Sl-TNF was diluted to a 
working stock solution of 1 x lO'^ M in binding medium and stored for up to one 
month at 4''C without detectable loss of receptor binding activity. The specific activity 
is routinely 1 x 10^ cpm/mmole TNF. 

30 Binding to Intact Cells. Binding assays with intact cells were performed by 

two methods. In the first method, cells were first grown either in suspension (e.g., U 
937) or by adherence on tissue culture plates (e.g., WI26-VA4 or COS cells expressing 
the recombinant TNF receptor). Adherent cells were subsequently removed by 
treatment with 5mM EDTA treatment for ten minutes at 37 degrees centigrade. Binding 

35 assays were then performed by a pthalate oil separation method (Dower et al., /. 
Immunol 132:151, 1984) essentially as described by Park et al. (/. BioL Chem. 
267:4177, 1986). Non-specific binding of 125I-TNF was measured in the presence of 
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a 200-fold or greater molar excess of unlabeled TNF. Sodium azide (0.29c) was 
included in a binding assay to inhibit internalization of l^-I-TNF by cells. In the 
second method, COS cells transfecied with the TNF-R-coniaining plasmid, and 
expressing TNF receptors on the surface, were tested for the abihty to bind ^^^I-TNF 
5 by the plate binding assay described by Sims et al. {Science 247:585, 1988). 

C. Solid Phase Binding Assays. The ability of TNF-R to be stably adsorbed to 
nitrocellulose from detergent extracts of human cells yet retain TNF-binding activit>' 
provided a means of detecting TNF-R. Cell extracts were prepared by mixing a cell 
pellet with a 2 x volume of PBS containing 1% Triton X-100 and a cocktail of protease 

10 inhibitors (2 mM phenylmethyl sulfonyl fluoride, 10 pepstatin, 10 ^M leupeptin, 2 
mM o-phenanthroline and 2 mM EGTA) by vigorous vortexing. The mixture was 
incubated on ice for 30 minutes after which it was centrifuged at 12,000x g for 15 
minutes at 8*C to remove nuclei and other debris. Two microliter aliquots of cell 
extracts were placed on dry BA85/21 nitrocellulose membranes (Schleicher and 

15 Schuell, Keene, NH) and allowed to dry. The membranes were incubated in tissue 
culture dishes for 30 minutes in Tris (0.05 M) buffered saline (0.15 M) pH 7.5 
containing 3% w/v BSA to block nonspecific binding sites. The membrane was then 
covered with 5 x lO'^l M I^Sj.tNT in PBS + 3% BSA and incubated for 2 hr at 4^C 
with shaking. At the end of this time, the membranes were washed 3 times in PBS, 

20 dried and placed on Kodak X-Omat AR film for 18 hr at -70X. 

D. Signal Transduction Assays. Inhibition of TNF signal transduction 
activity can be determined by transfecting cells with recombinant TNF-R DNAs 
encoding membrane-bound TNF-R to obtain recombinant receptor expression on the 
cell surface. The cells are then contacted with TNT and the resulting metabolic effects 

25 examined. If an effect results which is attributable to the action of the ligand, and is not 
attributable to endogenous TNF receptors on the cells, then the recombinant receptor 
has signal transduction activity. Exemplarj' procedures for determining whether a 
polypeptide has signal transduction activity are disclosed by Idzerda et al., J. Exp. 
Med. 777:861 (1990); Curtis et al., Proc. Natl. Acad. Sci. USA 56:3045 (1989); 

30 Prywes et al., EMBO J. 5:2179 (1986) and Chou et al., /. BioL Chem. 262:1842 
(1987). The ability of a soluble TNT-R polypeptide to competitively inhibit signal 
transduction can be determined using similar procedures. Primary cells or cell lines 
which express an endogenous TNT receptor and have a detectable biological response 
to TNF could be utilized as an alternative to the cells expressing recombinant 

35 membrane-bound TNF-R. Decreased signal transduction when a soluble TNF-R 
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polypeptide is added to the assay indicates binding of TNF by the soluble TNF-R, so 
that less TNF binds to the cell surface TNF receptors to initiate signal transduction. 



A. Radiolabeling of rIL-lfi, Recombinant human EL- IB was prepared by 
expression in £. coli and purification to homogeneity as described by Kronheim et al. 
(Bio/Technology 4:1078, 1986). The IL-IB was labeled with di-iodo (^^Sj) Bolton- 
Hunter reagent (New England Nuclear, Glenolden, PA). Ten micrograms (0.57 mnol) 
of protein in 10 uL of phosphate (0.015 mol/L)-buffeTed saline (PBS; 0.15 mol/L), pH 
7.2, was mixed with 10 uL of sodium borate (0.1 mol/L)-buffered saline (0.15 mol/L), 
pH 8.5, and reacted with 1 mCi (0.23 nmol) of Bolton-Hunter reagent according to the 
manufacturer's insuiictions for 12 hours at 8'C. Subsequently, 30 uL of 2% gelatin 
and 5 uL of 1 mol/L glycine ethyl ester were added, and the protein was separated from 
unreacted Bolton-Hunter reagent on a 1 mL bed volume Biogel™ P6 column (BioRad 
Laboratories, Richmond, CA). Routinely, 50% to 60% incorporation of label was 
observed. Radioiodination yielded specific activities in the range of 1 x lO^^ to 5 x 
1015 q)m/mmol-l (0.4 to 2 atoms I per molecule protein), and sodium dodecyl sulfate- 
polyaciylamide gel electrophoresis (SDS/PAGE) revealed a single labeled polypeptide 
of 17.5 kD, consistant with previously reponed values for EL-l. The labeled protein 
was greater than 98% TCA precipitable, indicating that the 125i ^as covalendy bound 



B. Inhibition Binding Assay for Membrane-Bound IL-IR. "IL-1" refers 
collectively to IL-la and IL-lB. The binding inhibition constant of an IL-IR protein 
may be determined by inhibition binding assays in which varying concentrations of a 
competitor (IL-IB or IL-la) are incubated with a constant amount of radiolabeled IL-IB 
or IL-la and cells expressing die IL-IR. The non-radiolabeled competitor binds to the 
receptor and prevents the radiolabeled ligand from binding to die receptor. Binding 
assays were performed by a phdialate oil separation mediod essentially as described by 
Dower et al., J. Immunol. 132:151, 1984 and Park et al., /. BioL Chem, 261:4111, 
1986. Briefly, host cells expressing a membrane-bound recombinant IL-IR were 
incubated in six-well plates (Costar, Cambridge, MA) at 4X for 2 hours witii l^Sj-iL- 
IB in 1 ml binding medium (Roswell Park Memorial Institute (RPMI) 1640 medium 
combining 2% BSA, 20 mM hepes buffer, and 0.1% sodium azide, pH 7.2). Sodium 



Example 2 

IL-1 Binding Assays 
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azide was included to inhibit internalization and degradation of ^-^I-IL-l by cells at 
37'C. The plates were incubated on a g>Taiory shaker for 1 hour at 37*C. Replicate 
aliquots of the incubation mixture were then transferred to polyethylene centrifuge tubes 
containing a phthalate oil mixture comprising 1.5 pans dibutylphthalate, to 1 part bis(s- 
5 . ethylhexyl)phthalate. Control tubes containing a lOOX molar excess of unlabeled EL- IB 
were also included to determine non-specific binding, The cells with bound ^^^I-IL-l 
were separated from unbound ^'^^l-JL-l by centrifugation for 5 minutes at 15,00OX g 
in an Eppendorf Microfuge. The radioactivity associated with the cells was then 
determined on a gamma counter. 

10 C. Inhibition Binding Assay for Soluble IL-IR. The binding inhibition 

constant of a soluble human EL-IR may be determined by an inhibition binding assay in 
which var>'ing concentrations of an IL-16 competitor are incubated with a constant 
amount of radiolabeled I-IL-IB and CB23 cells (an Epstein Ban* varus transformed cord 
blood B lymphocyte cell line) expressing the type II IL-IR. A cell line expressing 

15 endogenous tv^^e I IL-1 receptors may be substituted for the CB23 cells in assays 
involving soluble type I IL-IR. Binding assays were performed by a phthalate oil 
separation method essentially as described by Dower et al., J. Immunol 132:151, 1984 
and Park et al., J. Biol Chem. 267:4177, 1986. Briefly, CVI-EBNA (mammalian) 
cells were transfected with the expression vector pDC406 containing a cDNA encoding 

20 a soluble human type 11 IL-IR as described in example 10. Supematants from the cells 
were harvested 3 days after transfecnon and serially diluted in binding medium 
(Roswell Park Memorial Institute (RPMI) 1640 medium containing 2% BSA, 20 mM 
Hepes buffer, and 0.2% sodium azide, pH 7.2) in 6 well plates to a volume of 50 
M-l/well. The supematants were incubated with 50 ^l1 of 9 x lO'^O m 125i-IL-1B plus 

25 2.5 X 106 CB23 cells at S^C for 2 hours with agitation. Duplicate 60 ul aliquots of the 
incubation mixture were then transferred to polyethylene centrifuge types containing a 
phthalate oil mixture comprising 1.5 parts dibutylphthalate, to 1 part bis(s- 
ethylhexyl)phthalate. A negative control tube containing 3 x lO'^ M unlabeled EL-IB 
was also included to determine non-specific binding (100% inhibition) and a positive 

30 control tube containing 50 ml binding medium with only radiolabeled IL-16 was 
included to determine maximum binding. The cells with bound ^^^I-JL-IR were 
separated from unbound ^-^I-TL-IQ by centrifugation for 5 minutes at 15,000 X g in an 
Eppendorf Microfuge. Supematants containing unbound ^'^^I-IL-IB were discarded 
and the cells were carefully rinsed with ice-cold binding medium. The cells were 

35 incubated in 1 ml of tnr-psin-EDTA at 37'C for 15 minutes and then harvested. The 
radioactivity associated wth the cells was then determined on a gamma counter. The 
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ability of soluble IL-IR to inhibit binding of IL-la to endogenous cellular receptors 
may be determined by the same procedure. Analogous techniques may be employed in 
assays involving soluble TNF-R. 



Isolation of Human TNF-R cDNA by Direct Expression of Active Protein in COS-7 

CeUs 

Various human cell lines were screened for expression of TNF-R based on theu- 

10 ability to bind l^Sj^abeled TNF. The human fibroblast cell line WI-26 VA4 (ATCC 
CCL 95.1)was found to express a reasonable number of receptors per cell. 
Equilibrium binding studies showed that the cell line exhibited biphasic bmding of ^^I- 
TNF with approximately 4,000 high affinity sites (Ka = 1 x 10^0 and 15,000 low 
affinity sites (Ka = 1 x 10^ M-^) per cell. 

15 An unsized cDNA library was constructed by reverse transcription of 

polyadenylated mRNA isolated &om total RNA extracted from human fibroblast WI-26 
VA4 cells grown in the presence of pokeweed mitogen using standard techniques 
CGubler, et al.. Gene 25:263, 1983; Ausubel et aL, eds.. Current Protocols in 
Molecular Biology, VoL 1, 1987). The cells were harvested by lysing the cells in a 

20 guanidine hydrochloride solution and total RNA isolated as previously described 
(March et al„ Nature 315:641, 1985). 

Poly A"^ RNA was isolated by oligo dT cellulose chromatography and double- 
stranded cDNA was prepared by a method similar to that of Gubler and Hoffman (Gene 
25:263, 1983). Briefly, the poly A+ RNA was convened to an RNA-cDNA hybrid by 

25 reverse transcriptase using oligo dT as a primer. The RNA-cDNA hybrid was then 
convened into double-stranded cDNA using RNAase H in combination with DNA 
polymerase I. The resulting double stranded cDNA was blunt-ended with T4 DNA 
polymerase. To the blunt-ended cDNA is added EcoRl linker-adapters (having internal 
Notl sites) which were phosphorylated on only one end (Invitrogen). The linker- 

30 adaptered cDNA was treated with T4 polynucleotide kinase to phosphorylate the 5' 
overhanging region of the linker-adapter and unligated linkers were removed by 
running the cDNA over a Sepharose CL4B column. The linker-adaptered cDNA was 
li gated to an equimolar concentration of EcoRl cut and dephosphorylated arms of 
bacteriophage XgtlO (Huynh et al, DNA Cloning: A Practical Approach, Glover, ed., 

35 IRL Press, pp. 49-78). The ligated DNA was packaged into phage panicles using a 
commercially available kit to generate a library of recombinants (Stratagene Cloning 
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Systems, San Diego, CA, USA). Recombinants were further amplified by plating 
phage on a bacterial lawn of E. coli strain c600(hfl*). 

Phage DNA was purirled from the resulting >.gtlO cDKA library and the cDNA 
inserts excised by digestion with the restriction enzyme fsotl. Following 
5 electrophoresis of the digest through an agarose gel, cDNAs greater than 2,000 bp w^ere 
isolated. 

The resulting cDNAs were ligated into the eukaryotic expression vector 
pCAV/NOT, which was designed to express cDNA sequences inserted at its multiple 
cloning site when transfected into mannmalian cells. pCAV/NOT was assembled from 

10 pDC201 (a derivative of pMLSV, previously described by Cosman et al., Nature 312: 
768, 1984), SV40 and cytomegalovirus DNA and comprises, in sequence with the 
direction of transcription from the origin of replication: (1) SV40 sequences from 
coordinates 5171-270 including the origin of replicarion, enhancer sequences and early 
and late promoters; (2) cytomegalovirus sequences including the promoter and enhancer 

15 regions (nucleotides 671 to +63 from the sequence published by Boechan et al. {Cell 
41:521, 1985); (3) adenovirus-2 sequences containing the first exon and part of the 
intron between the first and second exons of the tripartite leader, the second exon and 
part of the third exon of the triparute leader and a multiple cloning site (MCS) 
containing sites for Xhol, Kpnl, Smal, Notl and Bgll; (4) SV40 sequences from 

20 coordinates 4127-4100 and 2770-2533 that include the polyadenylation and termination 
signals for early transcription; (5) sequences derived from pBR322 and virus- 
associated sequences VAI and VAII of pDC201, with adenovirus sequences 10532- 
11156 containing the VAI and VAU genes, followed by pBR322 sequences from 4363- 
2486 and 1094-375 containing the ampicillin resistance gene and origin of replication. 

25 pCAV/NOT has been deposited with the American Type Culture Collection under 
accession no. ATCC 68014. 

The resulting Wa-26 VA4 cDNA library in pCAV/NOT was used to transform 
E. coli strain DH5a, and recombinants were plated to provide approximately 800 
colonies per plate and sufficient plates to provide approximately 50,000 total colonies 

30 per screen. Colonies were scraped from each plate, pooled, and plasmid DNA 
prepared from each pool The pooled DNA was then used to transfect a sub-confluent 
layer of monkey COS-7 cells using DEAE-dextran followed by chloroquine treatment, 
as described by Luthman et al. {NucL Acids Res, 77:1295, 1983) and McCutchan et al. 
(J. Natl. Cancer Inst. 47:351, 1986). The cells were then grown in culture for three 

35 days to permit transient expression of the insened sequences. After three days, cell 
culture supematants were discarded and the cell monolayers in each plate assayed for 
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TNF binding as follows. Three ml of binding medium containing 1.2 x 10"^^ M ^^^I- 
labeled FLAG®-TNF was added to each plate and the plates incubated at 4*C for 120 
minutes. This medium was then discarded, and each plate was washed once with cold 
binding medium (containing no labeled TNF) and twice with cold PBS. The edges of 
5 each plate were then broken off, leaving a flat disk which was contacted with X-ray 
film for 72 hours at -70*C using an intensifying screen as described by Sims et al., 
Science 241:5^5 (1988). TNF binding activity was visualized on the exposed films as 
a dark focus against a relatively uniform background 

After approximately 240,(KX) recombinants from the library had been screened 

10 in this manner, one transfectant pool was observed to provide TNF binding foci which 
were clearly apparent against the background exposure. A frozen stock of bacteria 
from the positive pool was then used to obtain plates of approximately 150 colonies. 
Replicas of these plates were made on nitrocellulose filters, and the plates were then 
scraped and plasmid DNA prepared and transfected as described above to identify a 

15 positive plate. Bacteria from individual colonies from the nitrocellulose replica of this 
plate were grown in 0.2 ml cultures, which were used to obtain plasmid DNA, which 
was transfected into COS-7 ceUs as described above. In this manner, a single clone 
was isolated which was capable of inducing expression of human TNF-R in COS cells. 
The expression vector pCA V/NOT containing this TNF-R cDNA has been deposited 

20 with the American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 
20852, USA (Accession No. 68088) under the name pCA V/NOT-TNF-R. 



Example 4 

Construction of cDNAs Encoding Soluble huTNF-RA235 

25 

A cDNA encoding a soluble huTNF-RA235 was constructed. The encoded 
protein comprises the sequence of amino acids -22 to 235 of Figure 2A. Processing of 
the signal sequence yields a protein having the sequence of amino acids 1 to 235 of 
Figure 2A. An 840 bp fragment was excised from pCAV/NOT-TNF-R with the 
30 restricrion enzymes Notl and Pvu2. Notl cuts at the multiple cloning site of 
pCAV/NOT-TNF-R and Pvu2 cuts within the TNF-R coding region 20 nucleotides 5' 
of the transmembrane region. In order to reconstruct the 3' end of the TNF-R 
sequences, two oligonucleotides were synthesized and annealed to create the following 
oligonucleotide linker: 



35 
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?vu2 Bamri: 3gl2 

SEQ ID NO: 9 C T 3AAG 3 3A 3 CACTGGC 3 A CIA£.GGATCCA 

GACTrCGCrCGrGACCGCTGATTCCTAGGTCTAG 
Ala31'jGlvSerThrGlyAsp£iiui 

5 

This oligonucleotide linker has lerminal Pvu2 and Bgl2 restriction sites, regenerates 20 
nucleotides of the TNF-R, followed by a teraiination codon (underlined) and a BamHl 
restriction site (for convenience in isolating the entire soluble TNF-R by Notl/BamHl 
digestion). This oligonucleotide was then ligated with the 840 bp Notl/Pvu2 TNF-R 
10 insert into Bgl2/Notl cut pCAV/NOT to yield psoIhuTNF-RA235/CAVNOT, which 
was transfected into COS-7 cells as described above. This expression vector induced 
expression of soluble human TNF-R which was capable of binding TNF. 



15 
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Example 5 

Construction of cDNAs Encoding Soluble huTNF-RA185 

A cDNA encoding a soluble huTNF-RA185 ha\dng the sequence of amino acids 
-22-185 of Figure 2A (or amino acids M85 upon processing of the signal sequence in 
an appropriate host cell) was constructed by excising a 640 bp fragment from 
pCAV/NOT- TNF-R with the restriction enzymes Notl and Bgl2. Notl cuts at the 
multiple cloning site of pCAV/NOT-TNT-R and Bgl2 cuts witiim the TNF-R coding 
region at nucleotide 637, which is 237 nucleotides 5' of the transmembrane region. 
The following oligonucleotide linkers were synthesized: 



Bgl2 

SEQ ID NO: IC 5 ' -GATCTGTAACGTGGTGGCCATCCCTGGGAATGCAAGCATGGATGC- 3 ' 

AGATT'GCACCACCGGTAGGGACCCTrACGTTCG 
2Q ^-eCysAsnValValAlalleProGivAsnAlaSerMetAspAla 

Notl 

SEQ ID NO: II 5'- AGTCTGCACGTCCACGTCCCCCACCCGGr^iAGC -3 » 

TACCTACGTCAGACGTGCAGGTGCAGGGGGTGGGCCACTCGCCGG 
ValCysThrSerThrSerPrcThrArcEnd 

35 

The above oligonucleotide linkers reconstruct the 3' end of tiie receptor molecule up to 
nucleotide 708, followed by a termination codon (underlined). These oligonucleotides 
were then ligated with tiie 640 bp Notl TNF-R insert into Notl cut pCA V/NOT to yield 
tiie expression vector psolTNTRAl 85/CAVNOT, which was transfected into COS-? 
40 cells as described above. This expression vector induced expression of soluble human 
TNF-R which was capable of bindins TNF. 
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Example 6 

Construction of cDNAs Encoding Soluble huTNF-RA163 

5 A cDNA encoding a soluble huTNF-RA163 having the sequence of amino acids 

-22-163 of Figure 2A (1-163 upon processing of the signal sequence) was constructed 
by excising a 640 bp fragment from from pCAV/NOT-TNF-R with the restriction 
enzymes Notl and Bgl2 as described in Example 4. The following oligonucleotide 
linkers were synthesized: 

10 

Bgl2 Notl 
SEQ ID NO: 12 5 * -GATCTGTISAGC -3' 

ACAACTCGCCGG 
IleCysEnd 

15 

This above oligonucleotide linker reconstructs the 3' end of the receptor molecule up to 
nucleotide 642 (amino acid 163), followed by a termination codon (underlined). This 
oligonucleotide was then ligated with the 640 bp Notl TNF-R insert into Notl cut 
pCAV/NOT to yield the expression vector psolTNFRA163/CAVNOT, which was 
20 transfected into COS-7 cells as described above. This expression vector induced 
expression of soluble human TNF-R which was capable of binding TNF in the binding 
assay described in Example 1. 



Example 7 

Construction of cDNAs Encoding Soluble huTNF-RA142 

A cDNA encoding a soluble huTNF-RA142 (having the sequence of amino 
acids -22-142 of Figure 2A (1-142 upon processing of the signal sequence) was 
30 constructed by excising a 550 bp fragment from from pCAV/NOT-TNF-R with the 
restriction enzymes Notl and AlwNl. AlwNl cuts within the TNF-R coding region at 
nucleotide 549. The following oligonucleotide linker was synthesized: 

Bgl2 Notl 
SEQ ID NO: 13 5 * -CTGAAACATCAGACGTGGTGTGCAAGCCCTGTIA^-S ^ 

CTTGACTTTGTAGTCTGCACCACACGTTCGGGACAATTTCTAGA 

End 

This above oUgonucleodde linker reconstructs the 3^ end of the receptor molecule up to 
nucleotide 579 (amino acid 142), followed by a termination codon (underiined). This 
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oligonucleotide was then ligated with the 550 bp Notl/AlwKl TNF-R insen into 
Notl/Bgl2 cut pCAV/NOT to yield the expression vector psolTNFRA142/CAVNOT, 
which was transfected into COS-7 cells as described above. This expression vector did 
not induce expression of soluble human TNF-R which was capable of binding TNF. It 
5 is believed that this panicular construct failed to express biologically active TNF-R 
because one or more essential cysteine residues (e.g., Cys^^'^ or Cys^^^) required for 
intramolecular bonding (for formation of the proper tertiary structure of the TNF-R 
molecule) was eliminated. 



10 

Example 8 

Isolation of Human type I IL-IR cDNA Clones 



cDNA encoding a human type I IL-IR protein was isolated by hybridization to a 

15 probe derived from murine ly^ipc I IL-IR cDNA. Cloning of this murine EL-IR cDNA 
is described in example 4 of EP 318,296. A vector containing the murine cDNA was 
deposited with the American Type Culture Collection under the name GEMBL78 on 
November 19, 1987 and given the accession number ATCC 67563. 

A 2356 base pair (bp) fragment of this deposited murine clone 78 was isolated 

20 as described by Sims et al. {Science 247:585, 1988) and radiolabeled by nick- 
translation using DNA polymerase I for use as a probe. The method employed was 
substantially similar to that disclosed by Maniatis et al. {Molecular Cloning: A 
Laboratory Manual Cold Spring Harbor Laborator\', 1982, p. 109). 

The probe was used to screen human cDNA libraries for human IL-IR, as 

25 described by Sims et al., Proc. NatL Acad. Sci. (USA) 56:8946, 1989. A cDNA 
library' was constructed by reverse transcription of polyadenylated mRNA isolated from 
total RNA extracted from the cultured cells of a human T-cell line designated clone 22, 
described by Acres et al. (7. Immunol, 138:2132, 1987), These cells were cultured in 
RPMI 1640 medium plus 10% fetal bovdne serum as described by Acres et al. {supra), 

30 in the presence of 10 ng/ml OKT3 antibody and 10 ng/ml hunmn IL-2. The cDNA was 
rendered double-stranded using DNA polymerase I, blunt-ended with T4 DNA 
polymerase, methylated w^th EcoRI methylase to protect EcoRI cleavage sites within 
the cDNA, and ligated to EcoRI linkers. The resulting constructs were digested with 
EcoRI to remove all but one copy of the linkers at each end of the cDNA, and ligated to 

35 EcoRI-cut and dephosphor\'lated arms of bacteriophage AgtlO (Huynh et al., DNA 
Cloning: A Practical Approach, Glover, ed., IRL Press, pp. 49-78). The ligated DNA 
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was packaged into phage particles using a commercially available kit (Stratagene 
Cloning Systems, San Diego, CA, USA 92121) to generate a library of recombinants. 
Recombinants were plated on E. coli strain C600(hfl-) and screened by standard 
plaque hybridization techniques under conditions of moderate stringency (50°C, 6 x 



Following several rounds of screening, nine clones were isolated from the 
library which hybridized to the cDNA probe. The clones were plaque purified and used 
to prepare bacteriophage DNA which was digested with EcoRI. The digests were 
electrophoresed on an agarose gel, blotted onto nylon filters, and retested for 
hybridization. The clones were digested witii EcoRI followed by preparative agarose 
gel electrophoresis, tiien subcloned into an EcoRI-cut derivative (pGEMBL) of the 
standard cloning vector pBR322 containing a polyUnker having a unique EcoRI site, a 
BamHl site and numerous other unique restriction sites. An exemplary vector of this 
type is described by Dente et al. {hlucl. Acids Res. ii:1645, 1983). 

Restriction mapping and sequencing of a 4.8 kb human IL-IR clone indicated 
that the clone included a sequence encoding 518 amino acids which exhibited 80% 
amino acid sequence identity to the corresponding murine sequence in tiie extracellular, 
or N-terminal region distal to the transmembrane region, 63% identity in the 
transmembrane region, and 87% identity in the cytoplasmic, or C-terminal region. A 
440 bp EcoRI-Nsil fragment derived from the 5' portion of tiie human IL-lR clone was 
32p-labeled by nick-translation as described above and used to screen a cDNA library 
produced by randomly-priming human T cell line clone 22 mRNA prepared as 
described above. 23 clones which hybridized to the probe were isolated and analyzed 
by restriction mapping. Sequencing of one of these clones provided tiie sequence 
information corresponding to tiie remaining N-terminal 34 amino acids of the human 
protein. The DNA and deduced amino acid sequence of die complete coding region of 
tiie type I human IL-IR are shown in SEQ ID NOS:5 and 6. This human IL-IR protein 
comprises 569 amino acids (including a 20 amino acid signal peptide), and includes 16 
cysteine residues, 13 of which are conserved between tiie murine and human genes. In 
addition, tiie human sequence includes six potential N-glycosylation sites, of which 
five are conserved between murine and human. 



5 
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Example 9 

Isolation of cDNAs Encoding T\pe IT IL-IR 



10 



15 



20 



25 



30 



A DNA sequence encoding human t\pe II IL-IR was isolated from a cDNA 
librarj' prepared using standard methods, by reverse transcription of polyadenylated 
RNA isolated from the human B cell Kmphoblastoid line CB23, described by Benjamin 
& Dower, Blood 75:2017, 1990. Briefly, the CB23 cell line is an EBV-transformed 
cord blood (CB) lymphocyte cell line, which was derived using the methods described 
by Benjamin et al, Proc. Natl. Acad, Sci, USA 57:3547, 1984. 

The CB23 libran' was screened by modified direct expression of pooled 
cDNA fragments in the monkey kidney cell line CV-l/EBNA-1 using a mammalian 
expression vector (pDC406) that contains origins of replication derived from SV40, 
Epstein-Barr virus and pBR322. pDC406 is a derivative of HAV-EO described by 
Dower et al., J, Immunol, 742:4314 (1989). pDC406 differs from HAV-EO by the 
deletion of the intron present in the adenovirus 2 tripartite leader sequence in HAV-EO. 
The CV-l/EBNA-1 cell line was derived by transfection of the CV-1 cell line with the 
gene encoding Epstein-Barr virus nuclear antigen- 1 (EBNA-1) and with a vector 
containing CMV regulatory sequences, so that EBNA-1 is expressed under the control 
of the human CMV immediate-early enhancer/promoter. The EBNA-1 gene allow^s the 
episomal replication of expression vectors such as pDC406 that contain the EBV origin 
of replication. 

Transfectants expressing biologically active type II IL-IR were initially 
identified using a modified slide autoradiographic technique, substantially as described 
by Gearing et al., EMBO J. S:3667, 1989. Briefly, CV-l/EBNA-1 cells were 
transfected with miniprep DNA in pDC406 from pools of cDNA clones directly on 
glass slides and cultured for 2-3 days to permit transient expression of t>T>e 11 IL-IR. 
The slides containing the transfected cells were then incubated with medium containing 
^^^I-IL-IB, washed to remove unbound labeled IL-IB, fixed with gluteraldehyde, and 
dipped in liquid photographic emulsion and exposed in the dark. After developing the 
slides, they were individually examined with a microscope and positive cells expressing 
type n IL-IR were identirled by the presence of autoradiographic silver grains against a 
light background. 

Using this approach, approximately 250,000 cDNAs were screened in pools of 
approximately 3,000 cDNAs using the slide autoradiographic method until assay of one 
transfectant pool showed multiple cells clearly positive for IL-IB binding. This pool 
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was then partitioned into pools of 500 and again screened by slide autoradiography. A 
positive pool was identified. This pool was further partitioned into pools of 75 and 
screened by plate binding assays analyzed by quantitation of bound ^^^l-JLAQ. The 
cells were scraped off and counted to determine which pool of 75 was positive. 
5 Individual colonies from this pool of 75 were screened until a single clone was 
identified which directed synthesis of a surface protein with detectable IL-IB binding 
activity. This clone was isolated, and its insert was sequenced to determine the 
sequence of the human type II EL-IR cDNA that is presented along with the amino acid 
sequence encoded thereby in SEQ ID N0S:7 and 8. The pDC406 cloning vector 

10 containing the human type n IL-IR cDNA, designated pHu IL-IR-II 75. was deposited 
in Exoli host cells witii the American Type Culuire Collection. Rockville, MD. USA 
(ATCC) on June 5, 1990 under accession number ATCC 68337. The deposit was 
made under the conditions of the Budapest Treaty. 

Like most mammalian genes, mammalian type II IL-IR is presumably encoded 

15 by multi-exon genes. Alternative mRNA constructs which can be attributed to different 
raRNA splicing events following transcription, and which share large regions of 
identity or similarity with the cDNAs claimed herein, are considered to be within the 
scope of the present invention. 



20 Example 10 

Construction and Expression of cDNAs Encoding Human Soluble Type 11 IL-IR 

A cDNA encoding a soluble human type IIIL-IR (having the sequence of 
amino acids 

25 -13-333 of SEQ ID NO:8) was constructed by polymerase chain reaction (PGR) 
amplification using the full length type E IL-IR cDNA clone 75 (ATCC 68337) in 
vector pDC406 (described in example 9) as a template. The following 5' 
oligonucleotide primer (SEQ ID NO:14) and 3* oligonucleotide primer (SEQ ID NO: 15) 
were first constructed: 



SEQ ID NO: 14 5'-GCGTCGACCTAGTGACGCTCATACAAATC-3^ 

<SalI> 



35 



SEQ ID NO: 15 5 ^ -GCGCGGCCGCICAGGAGGAGGCTTCCTTGACTG-S ' 

<-NotI->End\1191 \1172 
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The 5' primer corresponds to nucleotides 31-51 from the unffanslated region of human 
t\T)t n IL-IR clone 75 (SEQ ED N0:7) with a 5' add-on of a Sail restriction site; this 
nucleotide sequence is capable of annealing to the (-) strand complementary' to 
nucleotides 31-51 of human clone 75. The 3' primer is complementar>' to nucleotides 
1191-1172 (.which includes anti-sense nucleotides encoding 3 amino acids of human 
type n IL-IR clone 75 and has a 5' add-on of a Nod restriction site and a stop codon. 

The following PGR reagents were added to a 1.5 ml Eppendorf microfuge tube: 
10 Ml of lOX PGR buffer (500 mM KCl, 100 mM Tris-HCl, pH 8.3 at 25'C, 15 mM 
MgCl2, and 1 mg/ml gelatin) (Perkins-Elmer Cctus. Norwalk, CN), 10 of a 2mM 
solution containing each dNTP (2 mM dATP, 2 mM dCTP, 2 mM dOTP and 2mM 
dTTP), 2.5 units (0.5 nl of standard 5000 units/ml solution) of Taq DNA polymerase 
(Perkins-Elmer Cetus), 50 ng of template DNA and 5 ^1 of a 20 ^lM solution of each of 
the above oligonucleotide primers and 74.5 ^1 water to a final volume of 100 ^1. The 
final mixture was then overiaid with 100 \d parafin oil. PGR was carried out using a 
15 DNA thermal cycler (Ericomp, San Diego, CA) by initially denaturing the template at 
94* for 90 seconds, reannealing at 55° for 75 seconds and extending the cDNA at 72° 
for 150 seconds. PGR was carried out for an additional 20 cycles of amplification 
using a step program (denaturation at 94*, 25 sec; annealing at 55°, 45 sec; extension at 
72°, 150 sec), followed by a 5 minute extension at 72*. 
20 The sample was removed from the parafin oil and DNA extracted by 

phenolchloroform extraction and spun column chromatography over G-50 (Boehringer 
Mannheim). A 10 m-I aliquot of the extracted DNA was separated by electrophoresis on 
1% SeaKem agarose (FMC BioProducts, Rockland, ME) and stained with ethidium 
bromide to confirm that the DNA fragment size was consistent with the predicted 
25 product. 

20 M-1 of the PGR-amplified cDNA products were then digested with Sail and 
Not! restriction enzymes using standard procedures. The Sall/NotI restriction fragment 
was then separated on a 1.2% Seaplaque™ low gelling temperature (LGT) agarose, and 
the band representing the fragment was isolated. The fragment was ligated into the 
30 pDC406 vector by a standard "in gel" ligation metiiod. The resulting vector was 
transfected into GVl-EBNA cells and the soluble IL-IR protein was expressed. 
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Example 11 

Construction of Vector Encoding Di-TNF-R 

5 A vector encoding a fusion protein of the formula TNF-R - peptide 

linker - TNF-R and depicted in figure 3 was constructed as follows. Pertinent 
restriction enzyme cleavage sites in the TNF-R sequence are also shown in figures 2A- 
2B. 

The expression vector constructed in example 4 and designated psol huTNF- 
10 RA235/CAVNOT was digested with the restriction enzyme Not I, which cuts at the 
multiple cloning site of the pCAV/NOT vector (i.e., upstream of the TNF-R sequence 
inserted therein). The overhang generated by Not I digestion was filled in using the 
Klenow fragment of DNA polymerase I to produce a blunt end. The vector was then 
digested with Bam HI which cleaves downstream of the stop codon that follows the 
15 codon for amino acid 235, as shown in example 4. 

The blunted Notl/Bam HI fragment containing the TNF-R sequence was 
isolated by conventional procedures and inserted into a plasmid vector designated 
pCAV/DHFR which had been digested with Sma I and Bgl n. The pCAV/DHFR 
vector is an expression vector containing SV40 promoter sequences upstream of a 
20 multiple cloning site and other features as described for pCAV/NOT in example 3, and 
also contains a dihydrofolate reductase (DHFR) gene as a selectable marker. The 
DHFR gene confers a selective advantage on otherwise DHFR- mammalian cells that 
have taken up the vector, when grown in the presence of methotrexate (MTX). Sma I 
digestion produces blunt ends, to which the blunted Not I ends of the TNF-R- 
25 containing fragment are ligated. 1 ne Bgl Il-generated overhangs are ligated to the Bam 
Hl-digested ends of the TNF-R-containing fragment. The ligation destroys the Bam HI 
and Bgl H sites. E. coli cells are transformed with the ligation mixture by conventional 
procedures. Plasmid DNA is recovered from tiie host ceUs and die desired construct is 
confirmed by restriction analysis. The resulting vector containing the TNF-R insen is 
30 designated pCAV/DHFR/TNF-R. 

The following DNA fragments were isolated for use in preparing a vector 
encoding two TNF-R polypeptides separated by a peptide linker: 

(A) Asp71 8 (a restriction enzyme) to Esp I fragment of the pCAV/DHFR/TNF- 
R vector. Asp 7 1 8 cleaves tiie vector upstream of the insened TNF-R sequence; Espl 
35 (available from U.S . Biochemicals) cleaves the TNT-R sequence at tiie position shown 
in Figure 2A. The desired fragment is about 6.7 kilobase-pairs (kbp) in lengtii and 
includes vector sequences and tiie 3' end of a TNF-R sequence. 
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(B ) Asp718 to Pv^uII fragment of the expression vector constructed in example 
4 and designated psol hu TNF-R A235/CAVNOT. Asp718 cleaves the vector upstream 
of the insened TNT-R sequence. The desired 865 bp fragment includes a TNF-R 
sequence extending from the 5' signal sequence through the P\'U II site shown in 
5 example 4. 

(C ) a double-stranded oligonucleotide having the sequence (SEQ ID N0S:16 
and 17): 



5 ' CTGAAGGGAGCACTGGCGACGGTGGCGGTGGATCCGGCGGTGGCGGCG3CTCATTGCCCGCCCAGG 3 ' 
5 ' GACTTCCCTCGTGACCGCTGGCACCGCGACCTAGGCCGGCACCGCCGCCGAGTAACGGGCGGG 5 ' 

GluGiySerThrGlyAspGlyGlyGlyGlySerGlyGlyGlyGlyGlySerLeuProAlaGlnVal 



(D) Bgl I to Esp I fragment of the expression vector constructed in example 4 
and designated psol hu TNF-R A235/CAVNOT. The desired fragment is about 304 bp 

15 in length. Bgl I cleaves within the codon for amino acid 5 (Val) of TNF-R; Esp I 
cleaves the TNF-R sequence at the position shown in Figure 2A; the remainder (3' end) 
of this soluble TNF-R-encoding sequence is provided by fragment (A). 

The oligonucleotide (C) is prepared by conventional procedures for chemical 
synthesis of oligonucleotides. The oligonucleotide reconstructs the 3' end of the first 

20 TNF-R sequence from the P\^u II site tiirough the last amino acid of the extracellular 
domain (Asp at position 235). The oligonucleotide also contains an in-frame sequence 
encoding the peptide linker Gly^SerGIysSer. The sequence of this ponion of the 
oligonucleotide may be varied if desired to encode other peptide linkers. 
Oligonucleotide C also reconstructs the 5' end of the second TNT-R sequence from a 

25 codon for leucine, the first amino acid of the mature protein, through a partial codon for 
valine (amino acid 5) within the protruding 3' overhang, which will regenerate the Val 
codon when ligated to the complementary overhang on the Bgl I-digested end of 
fragment D. 

The DNA fragments designated A-D were ligated togetiier in the positions 
30 shown in figure 3 to form a vector designated pCA V DHFR Di-TNT-R which encodes 
a fusion protein of the present invention. £. coli cells are transformed with the ligation 
mixture by conventional procedures. Plasmid DNA is recovered from the host cells 
and the desired construct is confirmed by restriction analysis. The upstream TNT-R 
pol\7)eptide encoded by this expression vector contains amino acids -22 to 235 of SEQ 
35 ID NO:2 (i.e., a soluble TNT-R including the N-terminal signal sequence and the entire 
extracellular domain v^iihout a stop codon). The downstream TNT-R poK^^epride lacks 
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the signal sequence and contains amino acids 1-235 of SEQ ID NO:2, with a stop 
codon positioned immediately after amino acid 235. The peptide linker of this 
particular construct is Gly4SerGly5Ser. 

Mammalian cells are transfected with the expression vector by conventional 
5 procedures and cultured to produce the desired fusion protein. One suitable mammalian 
cell line is aDHFR" Chinese hamster ovary cell line designated CHO-Kl and available 
&Dm the American Type Culture Collection, Rockville, MD, under accession number 
CCL61. The cells may be transfected with the expression vector by standard calcium 
phosphate precipitation, essentially as described by Graham and van der Eb, Virology 

10 52:456 (1983). The transfected cells are cultured under suitable conventional 
conditions, and the presence of the desired fusion protein in the culture medium is 
confirmed by assays such as those described in examples 1 and 2. 

Another suitable mammalian cell line is the COS-7 cell line. In one experiment, 
COS-7 cells were transfected with the TNF-R dimer-encoding expression vector 

15 described above and cultured to allow expression of the dimer. Unconcentrated 
supernatant (containing the secreted dimer) was assayed for the abiUty to inhibit binding 
of radioiodinated murine TNFa to U937 cell (a human monocyte-like cell line bearing 
endogenous receptors for TNF). The inhibition binding assay was conducted 
according to conventional procedures, similar to those described in example 2, section 

20 C. Negative control tubes (to measure non-specific binding) contained non- 
radiolabeled TNFa, U937 cells, and radioiodinated TNFa. Positive control njbes (to 
measure maximum, uninhibited binding of radioiodinated TNFa to the U937 cells) 
contained binding medium, U937 cells, and radioiodinated TNFa. The dimer- 
containing supernatant inhibited over 90% of tht TNF binding to U937 cells that v^as 

25 seen for the positive control. 



A plasmid vector containing DNA encoding a fusion protein of the formula EL- 
IR-peptide linker-TNF-R-peptide linker-TNF-R is constructed as follows. 

cDNA encoding a soluble type I IL-IR polypeptide was isolated and amplified 
using the well known polymerase chain reaction (PCR) procedure. The following 
35 oligonucleotides were synthesized for use as primers in the PCR reaction: 



30 



Example 12 

Fusion Protein Comprising One IL-lR Polypeptide and 
Two TNF-R Polypeptides 



SEQ ID NO: 18 5' ACCGAGGGACCTGAGCG 3» 



wo 93/197" 



43 



PCT/L'S93/02938 



se:: id nd: 19 3' tcaattatatag3tcagtga::caccgccac:ctaggccgccaccgc 



These oligonucleotides as well as those discussed below are synthesized by 
conventional procedures, e.g., by using an automated DNA synthesis machine such as 
5 those available from Biosearch, Inc., San Rafael, California or Applied Biosysiems. 
The template employed in the PGR reaction is a plasmid vector prepared by inserting 
type I human IL-IR cDNA into a vector designated SF CAV. The SF CAV vector is a 
mammalian expression vector shown in figure 5 (which depicts the use of SF CAV in 
an additional vector construction described below.) SF CAV in £. coli cells was 

10 deposited with the American Type Culture Collection on Februar\' 27, 1992, under the 
terms of the Budapest Treat)', and was given accession number 68922. 

The SV40, CMV, pA, and VA sequences and ampicillin resistance gene in SF 
CAV are as described for pCAV/NOT in example 3 above and also in example 8 and 
figure 3 of PCT application WO 90/05183. A multiple cloning site positioned between 

15 the adenovirus-2 tripartite leader (TPL) and pA sequences contains recognition sites for 
the restriction endonucleases Xhol, Kpnl, Smal, Not!, and Bgll. The TPL sequence 
differs from that of pCAV/NOT in that a region believed to be detrimental to 
construction of IL-lR-encoding vectors has been deleted from the SF CAV TPL 
sequence. The adverse impact on IL-IR vectors may possibly be attributable to a 

20 cr\^ptic promoter in the undesirable sequence, from which undesired protein is 
expressed in £. coli. Low level expression of human IL-IR off the cr>^ptic promoter 
may be toxic to the bacteria. 

SF CAV is digested with Smal, which recognizes a unique restriction site 
within the multiple cloning site and produces blunt ends. A DNA fragment containing 

25 type I IL-IR cDNA is produced by Styl/Bglll digestion followed by filling in the 
overhangs using the Klenow fragment of DNA polymerase I to generate blunt ends. 
St>a cleaves at nucleotide 49 and Bgin cleaves at nucleotide 1997 of SEQ ID NO;5. 
The IL-IR cDNA fragment is ligated into the Smal-digested SF CAV vector, and £. 
coli cells are transformed with the ligation mixture by standard procedures. The 

30 resulting vector is recovered from the E. coli cells and used as the template in the PCR 
reaction. 

The 5' primer (SEQ ID NO: 18) corresponds to a 17-nucleotide sequence found 
in the vector upstream of the insened IL-IR cDNA. The 3' primer (SEQ ID NO: 19) 
includes a segment complementarv' to nucleotides 1060-1079 of SEQ ID N0:5, which 
35 encode amino acids 306 (partial codon) tiirough 312, near the C-terminus of the IL-IR 
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extracellular domain. This 3' primer also contains a sequence encoding the peptide 
linker Gly4SerGly5Ser. 

A PGR reaction is conducted using any suitable procedure, such as those 
described in Sarki et al.. Science 259:487 (1988); in Recombinant DNA Methodology, 
5 Wu et al., eds.. Academic Press Inc., San Diego (1989), pp. 189-196; and in PCR 
Protocols: A Guide to Methods and Applications, Innis et al., eds., Academic Press, 
Inc. (1990). An example of a suitable PCR procedure is as follows. All temperatures 
are in degrees centigrade. The following PCR reagents are added to a 0.5 ml 
Eppendorf microfuge tube: 10 ^1 of lOX PCR buffer (500 mM KCl, 100 mM Tris- 

10 HCI, pH 8.3 at 25*C, 25 mM MgCh, and 1 mg/ml gelatin) (Perkin-Elmer Cetus, 
Norwalk, CN), 8 ^I of a 2,5 mM solution containing each dNTP (2 mM dATP, 2 mM 
dCTP, 2 mM dGTP and 2 mM dTTP), 2.5 units (0.5 ^1 of standard 5000 units/ml 
solution) of Taq DNA polymerase (Perkins-Elmer Cetus), 1 ng of template DNA, 100 
picomoles of each of the oligonucleotide primers, and water to a final volume of 100 pi. 

15 The final mixture is then overlaid with 100 parafin oil. PCR is carried out using a 
DNA thermal cycler (Ericomp, San Diego, CA). The template is denatured at 94* for 5 
minutes and PCR is carried out for 25 cycles of amplification using a step program 
Cdenaturation at 94% 1.5 minutes; annealing at 60% 1 minute; extension at 72% 1 
minute). 

20 Electrophoresis of an aliquot of the reaction mixture on 1% SeaKem low 

melting temperature (LMT) agarose (FMC BioProducts, Rockland, ME) and staining 
witii ethidium bromide visualizes a single DNA fragment of the expected size as the 
PCR reaction product. The PCR-amplified DNA fragment comprises a shon vector 
sequence tiiat includes an Asp718 resuiction site, upstream of a sequence encoding IL- 

25 IR amino acids l(Asp) to 312 (Thr), foUowed by a single stranded segment encoding 
the peptide linker. 

A second PCR reaction is conducted to isolate and amplify a cDNA fragment 
encoding a soluble TNF-R polypeptide. The template was the vector designated 
psolhuTNF-R A235/CAVNOT in example 4, which contains a cDNA insert encoding 
30 TNF-R amino acids -22 to 235 of SEQ ID NO:L The primers employed in the reaction 



SEQ ID NO: 20 5' GGTGGCGGTGGATCCGGCGGTGGCGGCGGCTCATTGCCCGCCCAGGTGGCA 3' 



are: 



35 



SEQ ID NO: 21 3' TGACGCGCGACTCGTTCG 5* 
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The 5' primer (SEQ ID NO;20) comprises a peptide linker encoding segment 
complementar\' lo the peptide linker encoding ponion of the SEQ ED NO: 19 primer. 
The linker-encoding segment is followed by codons for the first six amino acids of 
mature TNF-R (Leu through Ala). 
^ The 3' primer (SEQ ID N0:21) comprises nucleotides complementary to 

nucleotides 461-478 of TNF-R SEQ ID NO:l, which encode amino acids 103 (Tyr, 
partial codon) through 109 (Gin, partial codon). This primer also encompasses an Espl 
restriction site that is naturally present in this portion of the TNT-R protein. 

The PGR reaction procedure is as described above. The DNA fragment 

10 amplified by this second PGR reaction comprises ihc above-described linker-encoding 
segment followed by a sequence encoding amino acids 1 (Leu) through 109 (Gin, 
panial codon) of TNF-R. This DNA fragment is visualized by electrophoresis 
followed by ethidium bromide staining of the gel, as described above. A third PGR 
reaction is conducted to isolate an amplified double stranded DNA fragment comprising 

15 IL-IR and TNT-R sequences separated by the linker sequence. The following reagents 
were combined in a 0.5 ml. tube: 

3 nl (about 5-10 ng) of the IL-lR-peptide linker DNA fragment 
amplified in the first PGR reaction above - a 3 ^il aliquot is taken directly 
20 from the LMT agarose by micropipette, using VW light to \dsualize the 

desired band on the ethidium bromide stained gel 

3 Ml (about 5-10 ng) of the peptide linker-TNT-R DNA fragment 
amplified in the second PGR reaction above - a 3 ^1 aliquot is 
micropipetted directly from the region of the LMT agarose gel that 

-5 contains the desired band 

10 ^il of 10 X PGR buffer (described above) 

lOOpmole of the SEQ ED NO: 18 oligonucleotide as the 5' primer 

100 pmole of the SEQ JD N0:21 oligonucleotide as the 3' primer 

4 ^] of a 2.5 mM solution containing each of the four dNTPs 
^'^ 0.5 ^1 of Taq DNA polymerase (5 units/^1) 

water to a final volume of 100 ^1 

The PGR reaction cycles are conducted at tiie temperatures and for the time 
periods specified above. After the initial denaturing step, the complementar>' peptide 
35 linker^encoding segments of tiie two DNA fragments anneal. The end product of the 
reaction is a blunt-ended double-stranded amplified DNA fragment about 1370 bp in 



wo 93/19777 PCr/US93/02938 

W 46 0 

length, comprising an IL-IR DNA sequence upstream of a sequence encoding a peptide 
linker, followed by a TNF-RDNA sequence. 

A 25 ^l aliquot of this third PGR reaction mixture is, without purification, 
reacted with the restriction enzymes Asp718 and Espl. Asp 718 cleaves upstream of 
5 the IL-IR sequence, as discussed above. Espl cleaves within the TNF-R sequence, as 
shown in Figure 2A and discussed above. The restriction endonuclease Cell n is an 
isoschizomer of Espl and may be substituted for Espl. The desu-ed fragment, referred 
to as fragment E hereinafter, is purified by conventional procedures such as separation 
by gel electrophoresis, e.g., on a 1.0% Seaplaque low melting temperature agarose gel, 
10 and isolation of the band representing the desired fragment 

Two additional DNA fragments designated F and G are isolated and joined with 
fragment E to constmct an expression vector having a second TNF-R sequence fused 
(via a peptide linker sequence) downstream of the IL-lR-iinker-TNF-R DNA fragment 
prepared above. The resulting vector and the positions of fragments E, F, and G 
15 contained therein are depicted in figure 4. 

The DNA fragment designated F is prepared by digesting the vector depicted in 
Figure 3 and constructed in example 1 1 with Espl. A fragment containing a 3' portion 
of TNF-R (extending from the Espl site shown in Figure 2A through a codon for 
amino acid 235) followed by a sequence encoding a Gly4SerGly5Ser peptide linker that 
20 is followed by a second TNF-R sequence extending from the codon for amino acid 1 
(Leu) to the Espl site in the second (downstream) TNF-R sequence of the figure 3 
vector, is isolated. This fragment, about 739 base pairs in length, is designated 
fragment F hereinafter. 

Fragment G, containing vector sequences (including DHFR) and a 3' portion of 
25 a TNF-R sequence is isolated from a "splice free" vector as follows. The 
pCAV/DHFR>rrNF-R vector prepared in example 11 contains a sequence which is 
believed to be disadvantageous for construction of IL-lR-encoding vectors, possibly 
because of a cryptic promoter wid:un tiiis sequence from which undesired protein is 
expressed in £. colL A derivative of pCAV/DHFR/TNF-R was prepared by replacing 
an Ndel/Asp 718 vector fragment that contains die undesirable sequence with an 
NdeT/Asp 718 fragment from splice free vector SF CAV (ATCC 68922, described 
above). The construction is depicted in figure 5. 

SF CAV is digested with Ndel and Asp 718 (unique sites in this plasmid) and 
the fragment of about 500 bp labeled H in figure 5 is isolated. Fragment H lacks the 
undesired sequence found in the corresponding fragment of pCAV/DHFR/TNF-R. 
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Tne pCAV/DHFR/TNF-R vector prepared in example 1 1 and shown in figure 5 
is digested with Asp 718, Espl, and Ndel. An Asp 718/EspI fragment of about 495 bp 
is isolated (fragment I). An Xdel/Espl fragment of about 4647 bp is also isolated 
(fragment J). 

5 DXA fragments H, I, and J prepared above are ligated together to form the 

splice free vector SF CAV/DHFR/TNF-R shown in figure 5. £. coli cells are 
transformed with the ligation mixture by conventional procedures. Plasmid DNA is 
recovered from the host cells and the desired construct is confirmed by restriction 
analysis. SF pCAV/DHFR/TNF-R is then digested with Asp718 and Espl. The 

10 fragment designated G in figure 5 contains vector sequences (including DHFR) and the 
3' end of a TNF-R sequence (from the internal Espl site through the codon for amino 
acid 235 followed by a stop codon) and is isolated by conventional procedures. 

DNA fragments E, F, and G prepared above are ligated together to form the 
vector depicted in figure 4 (and designated SF CAV DHFR tri-R), £. coli cells are 

15 transformed with the ligation mixture and the desired plasmid is recovered as described 
above. The fusion protein encoded by this vector comprises (from N- to C- teroiinus) 
amino acids -20 to 312 of type I IL-IR (SEQ JD N0:5); a Gly4SerGly5Ser peptide 
linker; amino acids 1 to 235 of TNF-R (SEQ ID N0:1); a Gly4SerGly5Ser peptide 
linker; and a second TNF-R polypeptide comprising amino acids 1 to 235 of SEQ ID 

20 N0:1. 

Mammalian cells are transfected with the expression vector by conventional 
procedures and cultured to produce the desired fusion protein. One suitable mammalian 
cell line is a DHFR- Chinese hamster ovar\' cell line designated CHO-Kl and available 
firom the American Type Culture Collection, Rockville, NID, under accession number 

25 CCL61. The cells may be transfected wdth the expression vector bv standard calcium 
phosphate precipitation, essentially as described by Graham and van der Eb, Virology 
52:456 (1983). The transfected cells are cultured under suitable conventional 
conditions, and the presence of the desired fusion protein in the culture medium is 
confirmed by assays such as those described in examples 1 and 2. 

30 DNA sequencing of an expression vector constructed as described above 

revealed two point mutations in the fusion protein-encoding sequences, which may 
have have occurred during PCR. The "T" at position 437 of SEQ ID NO:5 (type I IL- 
IR) was changed to a "C" and the"C" at position 687 of SEQ ID N0:1 (TNT-R) was 
changed to a *'T'. These mutations are silent, so the amino acid sequence of the IL-IR- 

35 peptide linker- TNF-R-peptide linker- TNT-R fusion protein (termed the ' mmeric 
receptor" hereinafter) is as described above. 
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COS-7 cells were transfected with the expression vector and cultured to allow 
expression of the trimeric receptor. 5X concentrated supernatant (containing the 
secreted trimeric receptor protein) was assayed for the ability to inhibit binding of 
radioiodinated murine TNFa (0.5 nM) to U937 cells (a human monocyte-like ceU line 
5 bearing endogenous receptors for TNF). The inhibition binding assay was conducted 
according to conventional procedures, similar to those described in example 2, section 
C. Negative control tubes (to measure non-specific binding) contained non- 
radiolabeled TNFo, U937 ceUs, and radioiodinated TNFa. Positive control mbes (to 
measure maximum, uninhibited binding of radioiodinated TNFa to tiie U937 cells) 
10 contained binding medium, U937 cells, and radioiodinated TNFa. The trimeric 
receptor-containing supernatant inhibited about 100% of the TNF binding to U937 cells 
that was seen for the positive control. 

The 5X concentrated trimeric receptor-containing supernatant also was assayed 
for the ability to inhibit binding of radioiodinated human IL-la (0.14 nM) to EL4 6.1 
15 cells, which bear endogenous IL-1 receptors. The EL4 6. 1 cell line was derived fix)m a 
murine thyoma cell line as described by MacDonald et al. {J. Immunol. 135:3944, 
1985). Negative control tubes (to measure non-specific binding) contained non- 
radiolabeled IL-la, EL4 6.1 ceUs, and radioiodinated IL-la. Positive control tubes (to 
measure maximum, uninhibited binding of radioiodinated IL-la to tiie EL4 6.1 cells) 
contained binding medium, EL4 6.1 cells, and radioiodinated IL-la. The trimeric 
receptor-containing supernatant inhibited 47% of the IL-1 binding to EL4 6.1 cells that 
was seen for the positive control. Supemanant from COS-7 cells expressing a 
monomeric soluble type I IL-IR, assayed as a control, inhibited 52% of the IL-1 
binding to EL4 6.1 cells that was seen for the positive control. 

25 
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30 



Example 13 

Fusion Protein Comprising One IL-IR P olvpentide and One TNF-R PolvpeDtirie 



Fragments E and G prepared in example 12 may be ligated together to form a 
vector containing a DNA sequence that encodes a fusion protein of the formula IL-IR- 
peptide linker-TNF-R, wherein the peptide linker is Gly4SerGly5Ser. IL-IR and TNF- 
R are soluble polypeptides as described in example 12. The fusion protein of example 
35 12 is prefen-ed due to the enhanced TNF binding achieved when two, ratiier than one, 
TNF-R polypeptides are employed. 
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COS-7 cells were transfected with the expression vector and cultured to allow 
expression of the IL-lR-peptide linker-TNF-R fusion protein. 5X concentrated 
supernatant (containing the secreted fusion protein) was assayed for the ability to inhibit 
binding of radioiodinated human EL- la (0.14 nM) to EL4 6.1 cells (a murine thyoma- 
derived eel! line bearing EL-l receptors as described in example 12), The inhibition 
binding assay procedure was similar to that described in example 2, secton C, and the 
positive and negative controls were as described in example 12. The fusion protein- 
containing supernatant inhibited 53% of the IL-1 binding to EL4 6.1 cells that was seen 
for the positive control. 



Additional Fusion Proteins 



The skilled artisan will appreciate that the techniques disclosed herein may be 
employed to produce additional fusion proteins of the present invention, beyond those 

15 illustrative embodiments presented in the foregoing examples. The peptide linker may 
be varied by synthesizing an oligonucleotide encoding a different peptide linker 
sequence, for example. Further, altemarive fragments of the IL-IR or TNF-R proteins 
may be isolated by employing PGR primers that anneal to a different desired portion of 
the disclosed DNA sequences thus defining the termini of the desired fragment. 

20 Oligonucleotides used to regenerate a terminus of a DNA fragment (e.g., the 
oligonucleotide employed in example 4) may be varied to position a stop codon after 
any desired amino acid. Funher, the choice of expression vector will depend upon the 
intended host cells. 



25 BRIEF DESCRIPTION OF T HE SEOTiFNCE LTSTTNO 

SEQ ID N0:1 and SEQ ID NO:2 show the nucleotide sequence and encoded 
amino acid sequence of a human TNF-R cDNA. The mature protein is defined by 
amino acids 1 - 439. The signal peptide is defined by amino acids -22 through -1. The 
30 transmembrane region is defined by amino acids 236 - 265. 

SEQ ID N0:3 and SEQ ID N0:4 show the nucleotide sequence and encoded 
amino acid sequence of the coding region of a human TNT-R cDNA. This TNF-R is a 
different protein from iht TNF-R of SEQ ID N0S:1 and 2. The mature protein is 
defined by amino acids 1-415. The signal peptide is defined by amino acids -40 
35 through -1. The transmembrane region is defined by amino acids 172 - 192. 
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SEQ ID NO:5 and SEQ ID NO:6 show the nucleotide sequence and encoded 
amino acid sequence of a human type I IL-IR cDNA. The mature protein is defined by 
amino acids 1 - 549. The predicted signal peptide is defined by amino acids -20 
through -1. The transmembrane region is defined by amino acids 317 - 336. 
5 SEQ ID NO:7 and SEQ ID NO:8 show the nucleotide sequence and encoded 

amino acid sequence of a human type U IL-IR cDNA. The mature protein is defined 
by amino acids 1 - 385. The predicted signal peptide is defined by amino acids -13 
through -L The transmembrane region is defined by amino acids 331 - 356. 

SEQ E) NOS:9-15 and 18-21 depict oligonucleotides employed in constructing 
10 various recombinant plasmids. as described in the examples section. 

SEQ ID NOS:16 and 17 depict an oligonucleotide and the amino acid sequence 
encoded thereby, which includes a Gly4SerGly5Ser peptide linker sequence. This 
oligonucleotide is employed in die vector construction described in example 1 1. 
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SEQUENCE LISTING 

(1) genera:^ IKrORK.3i.TION: 

(i) .^vPPLICANT: ScjLrh, Ora^g A. 

(ii) TITLE OF INVENTION: Fusicn Proteins ComprjLsi.ng Turner 
Necrcs-s Eactcr Receorcr 



(iii) NUMBER OF SEQUENCES: 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Iirmunex Corporation 

(3) STREET: 51 University Street 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: U.S.A. 

(F) ZIP: 96101 



(v) CO.MPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUM3ER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 479,661 

(B) FILING DATE: 07-FEB-1990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 523,635 

(B) FILING DATE: lC-MAY-:990 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 701,415 

(B) FILING DATE: 16-JUN-1991 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 821,716 

(B) FILING DATE: 14-JAN-1992 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seese, Kathryn A. 

(B) REGISTRATION NT^ER: 32,172 

(C) REFERENCE /DOCKET NTTMBER: 2502 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 567-0430 

(B) TELEFAX: (206) 587-0606 



2) INFORMATION FOR SEQ ID NC : 1 : 

(i) SEQU^ENCE CHARACTERISTICS: 
(A) LENGTH: 'if^C^- r,;:»^o 
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(C) STRSHdEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAI. SOURCE: 

(A) ORGANISM:- Homo sapiens 

(G) CELL TYPE: Fibroblast 

(H) CELL LINE: WI-26 VA4 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: WI-26 VA4 

(B) CLONE: 1 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

CB) LOCATION: 88.. 1473 

(ix) FEATURE: 

(A) NAME/KEY: raat_peptide 

(B) LOCATION: 154.. 1470 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 88^.153 



(x) PUBLICATION INFORMATION: 

(A) AUTHORS: Smithy Craig A. 

Davis ^ Terri 
Anderson^ Dirk 
Solam, Lisabeth 
Beckmann^ M. P . 
Jerzy, Rita 
Dower, Steven K. 
Cosman, David 
Goodwin r Raymond - G . 
CB) TITLE: A Receptor for Tumor Necrosis Factor Defines 
an Unusual Family of Cellular and Viral Proteins 

(C) JOURNAL: Science 

(D) VOLUME: 248 

(F) PAGES: 1019-1023 

(G) DATE: 25-MAY-1990 



{y±) SEQUENCE DESCRIPTION: SEQ ID N0:1: 



GCGAGGCAGG CAGCCTGGAG AGAAGGCGCT GGGCTGCGAG GGCGCGAGGG CGCGAGGGCA 

GGGGGCAACC GGACCCCGCC CGCATCC ATG GCG CCC GTC GCC GTC TGG GCC 

Met Ala Pro Val Ala Val Tn; Ala 
"22 -20 " -15 

GCG CTG GCC GTC GGA CTG GAG CTC TGG GCT GCG GCG CAC GCC TT3 CCC 
Ala Leu AJ.a Val Gly Leu Glu Leu Tro Ala Ala Ala His i^l^ Le- ^-o 
-10 " -5 - ' 
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GCC CAG GTG GCA TTT kZff^ZZ TAC GCC CCG GAG CCC GGG AGC SIF^ TGC 2C" 

Ala Gin Val Ala ?he Thr Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys 
5 10 1*5 

CGG CTC AGA GAA TAC TAT GAC CAG ACA GCT GAG ATG TGC TGC AGC AA«A 2 55 

Arc: Leu Arg Glu Tvr Tvr Asp Glr. Thr Ala Gin Met Cys Cys Ser Lys 
20 ' " 25 30 

TGC TCG CCG GGC CAA CAT GCA AAA GTC TTC TGT ACC AAG ACC TCG GAC 3C3 
Cys Ser Pro Gly Gin Kis Ala Lys Val Phe Cys Thr Lys Thr Ser Asp 
35 ' 40 ^ 45 50 

ACC GTG TGT GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG AAC 3 51 

Thr Val Cvs Asis Ser Cys Glu Asp Ser Thr Tyr Thr Gin Leu Trp Asn 
55 ' 60 65 

TGG GTT CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT GAC CAG 3 99 

Trp Val Pro Glu Cvs Leu Ser Cys Gly Ser Arg Cys Ser Ser Asp Gin 
70 ' '75 ' ' 80 " 

GTG GAA ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC ACC TGC 4 47 

Val Glu Thr Gin Ala Cys Thr Arg Glu Gin Asn Arg lie Cys Thr Cys 
85 '90 95 ' 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AAG CAG GAG GGG TGC CGG CTG 4 95 

Arg Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys Arg Leu 
100 ' ' " '105 110 

TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC AGA CCA 5 43 

Cys Ala Pro Leu Arg Lys Cys Arg Pro Gly Phe Gly Val Ala Arg Pro 
115 120 125 ' 130 

GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC CCG GGG ACG 5 91 

Gly Thr Glu Thr Ser Asp Val Val Cys Lys Pro Cys Ala Pro Gly Thr 
135 140 145 

TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGC AGG CCC CAC CAG ATC 63 9 

Phe Ser Asn Thr Thr Ser Ser Thr Asp lie Cys Arg Pro His Gin lie 
150 155 160 

TGT AAC GTG GTG GCC ATC CCT GGG AAT GCA AGC ATG GAT GCA GTC TGC 5 £7 

Cys Asn Val Val Ala lie Pro Gly Asn Ala Ser Met Asp Ala Val Cys 
155 170 :75 

ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC CCA GGG GCA GTA CAC TTA 7 35 

Thr Ser Thr Ser Pro Thr Arg Ser Met Ala Pro Gly Ala Val His Leu 
18 0 IBS 190 

CCC CAG CCA GTG TCC ACA. CGA TCC CAA CAC ACG CAG CCA ACT CCA GAA 7B3 
Pro Gin Pro Val Ser Thr Arg Ser Gin His Thr Gin Pro Thr Pro Glu 
195 200 205 210 

CCC AGC ACT GCT CCA AGC ACC TCC TTC CTG CTC CCA ATG GGC CCC AGC 5 31 

Pro Ser Thr Ala Pre Ser Thr Ser Phe Leu Leu Pro Met Gly Pro Ser 
215 220 " 225 

CCC CCA GCT GAA GGG AGC ACT GGC GAC TTC GCT CTT CCA GTT GGA CTG =~9 
Pro Pro Ala Glu Gly Ser Tr.r Gly Asp Phe Ala Leu Pro Val Gly leu 
230 * ' 235 24C 
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ATT GTG GGT GTG ACA GCC TTG GGT CTA CTA ATA ATA GGA GTG GTG AAC 927 
lie Vai Gly Val Thr Ala Leu Gly Leu Leu lie lie Gly Val Val Asn 
245 250 255 

TGT GTC ATC ATG ACC CAG GTG AAA AAG AAG CCC TTG TGC CTG CAG AGA S7 5 

Cys Val lie Met Thr Gin Val Lys Lys Lys Pro Leu Cys Leu Gin Arg 
260 265 270 

GAA GCC AAG GTG CCT CAC TTG CCT GCC GAT AAG GCC CGG GGT ACA CAG 1023 
Glu Ala Lys Val Pro His Leu Pro Ala Asp Lys Ala Arg Gly Thr Gin 
275 " 280 285 290 

GGC CCC GAG CAG CAG CAC CTG CTG ATC ACA GCG CCG AGC TCC AGC AGC 1071 
Gly Pro Glu Gin Gin His Leu Leu lie Thr Ala Pro Ser Ser Ser Ser 
295 300 305 

AGC TCC CTG GAG AGC TCG GCC AGT GCG TTG GAC AGA AGG GCG CCC ACT 1119 
Ser Ser Leu Glu Ser Ser Ala Ser Ala Leu Asp Arg Arg Ala Pro Thr 
310 315 320 

CGG AAC CAG CCA CAG GCA CCA GGC GTG GAG GCC AGT GGG GCC GGG GAG 1167 
Arg Asn Gin Pro Gin Ala Pro Gly Val Glu Ala Ser Gly Ala Glv Glu 
325 330 335 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT GGG 1215 
Ala Arg Ala Ser Thr Gly Ser Ser Asp Ser Ser Pro Gly Gly His Gly 
340 345 ' 350 

ACC CAG GTC AAT GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC TCT GAC 12 63 

Thr Gin Val Asn Val Thr Cys lie Val Asn Val Cys Ser Ser Ser Asp 
355 360 365 370 

CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG GGA GAC ACA 1311 
His Ser Ser Gin Cys Ser Ser Gin Ala Ser Ser Thr Met Gly Asp Thr 
375 380 385 

GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG GTC CCC TTC TCC 13 59 

Asp Ser Ser Pro Ser Glu Ser Pro Lys Asp- Glu Gin Val Pro Phe Ser 
390 395 " 400 

AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG ACG CCA GAG ACC CTG 14 07 

Lys Glu Glu Cys Ala Phe Arg Ser Gin Leu Glu Thr Pro Glu Thr Leu 
405 410 415 

CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC CTT GGA GTG CCT GAT GCT 14 55 

Leu Gly Ser Thr Glu Glu Lys Pro Leu Pro Leu Gly Val Pro As? Ala 
420 425 430 

GGG ATG AAG CCC AGT TAACCAGGCC GGTGTGGGCT GTGTCGTAGC CAAGGTGGGC 1510 
Gly Met Lys Pro Ser 
435 440 

TGAGCCCTGG CAGGATGACC CT3CGAAGGG GCCCTGGTCC TTCCAGGCCC CCACCACTAG 1570 

GACTCTGAGG CTCTTTCTGG GCCAAGTTCC TCTAGTGCCC TCCACAGCCG CAGCCTCCCT lc3 0 

CTGACCTGCA G i c^- 
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(2) :NTORMAr:0.\' for sec K0:2: 

(i) SEQUENCE CHARACTERISriCS : 

(A) :.ENGTH: 4 61 arrlnc acids 
(5) TYPE: ardnc acid 
(T) TOPCLOGY: linear 

(ii) MOLECULE TYPE: proreir. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Pro Val Ala Val Trp Ala Ala Leu Ala Val Gly Leu Glu Leu 
-22 -20 -15 -ic 

Trp Ala Ala Ala His Ala Leu Pro Ala Gin Val Ala Phe Thr Pro Tyr 

- 5 io 

Ala Pro Glu Fro Gly Ser Thr Cys Arg Leu Arc Glu Tyr Tyr Asp Gin 

20 25 

Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Glv Gin His Ala Lys 
30 35 40 

Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys AsD Ser Cys Glu Asp 

50 ' 55 

Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val Pro Glu Cvs Leu Ser Cvs 
60 65 70 " 

Gly Ser Arg Cys Ser Ser Asp Gin Val Glu Thr Gin Ala Cys Thr Arc 

60 85 90 

Glu Gin Asn Arg lie Cys Thr Cys Arg Pro Gly Trp Tvr Cvs Ala Leu 
55 100 ' ' 105 

Ser Lys Gin Glu Gly Cys Arg Leu Cys Ala Pre Leu Arg Lvs Cys Arg 

115 120 

Pro Gly Phe Gly Val Ala Arg Pro Glv Thr Glu Thr Ser As:d Va- Va^ 
■ '30 135 

Cys Lys Pro Cys Ala Pro Gly Thr Phe Ser Asn Thr Thr Ser Ser -r^ - - 
-'^^ 145 150 

Asp lie Cys Arg Pro His Gin lie Cys Asn Val Val Ala lie ^ro Glv 
155 160 165 17b 

Asn Ala Ser Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arg Se- 
175 180 185 

Met Ala Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arg Se- 

155 200 

Gin His Thr Gin Pro Thr Pro Glu Pro Ser Thr Ala Pro Se- T^^r Se- 
205 210 215 

Phe Leu Leu Pro Met Gly Pro Ser Pro Pro Ala Glu Glv Ser Thr G'v 
220 225 230 
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Asp Phe Ala Leu Pro Val Gly Leu lie Val Gly Val Thr Ala Leu Gly 
235 240 245 250 

Leu Leu lie He Gly Val Val Asn Cys Val He Met Thr Gin Val Lys 
255 260 265 

Lys Lys Pro Leu Cys Leu Gin Arg Glu Ala Lys Val Pro His Leu Pro 
270 275 280 

Ala Asp Lys Ala Arg Gly Thr Gin Gly Pro Glu Gin Gin His Leu Leu 
285 290 295 

lie Thr Ala Pro Ser Ser Ser Ser Ser Ser Leu Glu Ser Ser Ala Ser 
300 305 310 

Ala Leu Asp Arg Arg Ala Pro Thr Arg Asn Gin Pro Gin Ala Pro Gly 
315 320 325 330 

Val Glu Ala Ser Gly Ala Gly Glu Ala Arg Ala Ser Thr Gly Ser Ser 
335 340 345 

Asp Ser Ser Pro Gly Gly His Gly Thr Gin Val Asn Val Thr Cys He 
350 355 360 

Val Asn Val Cys Ser Ser Ser Asp His Ser Ser Gin Cys Ser Ser Gin 
365 370 375 

Ala Ser Ser Thr Met Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro 
380 365 390 

Lys Asp Glu Gin Val Pro Phe Ser Lys Glu Glu Cys Ala Phe Arg Ser 
355 400 405 410 

Gin Leu Glu Thr Pro Glu Thr Leu Leu Gly Ser Thr Glu Glu Lys Pro 
415 420 425 

Leu Pro Leu Gly Val Pro Asp Ala Gly Met Lys Pro Ser 
430 435 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1368 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to inRNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: HOMO SAPIENS 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: laiTibda-gtl0-7ctnfbp 
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(ix) FEATURE: 

(A) NAME/KEY: OS 

(B) LOCATION: 1. .1366 

(ix) FEATURE; 

(A) NAME/ KEY: ir^t_peptide 

(r) LOCATION: 121.. 1363 

(ix) FEATURE: 

(A) NAME/KEY: sig_pepride 

(B) LOCATION: l.'l20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG GGC CTC TCC ACC GTG OCT GAC CTG CTG CTG CCG CTG GTG CTC CTG 4 6 

Met Gly Leu Ser Thr Val Pro Aso Leu Leu Leu Pro Leu Vai Leu Leu 

-35 ' -30 -25 

GAG CTG TTG GTG GGA ATA TAC CCC TCA GGG GTT ATT GGA CTG GTC CCT 96 

Giu Leu Leu Val Gly He Tyr Pro Ser Gly Val He Gly Leu Val Pro 

-20 -15 " -10 

CAC CTA GGG GAC AGG GAG AAG AGA GAT AGT GTG TGT CCC CAA GGA AAA 144 

His Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Glv Lys 

-5 1 5 ^ 

TAT ATC CAC CCT CAA AAT AAT TCG ATT TGC TGT ACC AAG TGC CAC AAA 192 

Tyr He His Pro Gin Asn Asn Ser He Cys Cys Thr Lvs Cvs His Lvs 

IC 15 20 ' ' ' 

GGA ACC TAC TTG TAC AAT GAC TGT CCA GGC CCG GGG CAG GAT ACG GAC 24 C 

Gly Thr Tyr Leu Tyr Asn Aso Cvs Pro Glv Pro Gly Gin Ksv Thr Aso 

25 30 ' 35 * 40 

TGC AGG GAG TGT GAG AGO GGC TCC TTC ACC GOT TCA GAA AAC CAC CTC 2 86 

Cys Arg Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu 

45 50 £5 



AGA CAC TGC CTC AGC TGC TCC AAA TGC CGA AAG GAA ATG GGT CAG GTG 
Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Glv Gin Val 
60 65 ' " 70 



GAG ATC TOT TCT TGC ACA GTG GAC CGG GAC ACC GTG TGT GGC TGC AGG 5 64 

Glu He Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg 
75 60 85 ^ " ' 



A.nG A.ii.C CAG TAC CGG CAT TAT TGG AGT GAA AAC CTT TTC CAG TGC TTC 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cvs Phe 

AAT TGC AGC CTC TGC CTC r_-.T GGG ACC GTG CAC CTC TCC TGC CAG GAG 

Asn Cys Ser Leu Cvs Leu Asn Glv Thr Val His Leu Ser Cvs Gl- G^ u 

HC 115 ■ 12C 



-.-C GTG TGC ACC TGC CAT GCA GGT TTC TTT CTA AGA GA_A 
rhr Val Cys Thr Cvs His Ala Glv Phe Phe L-^u A-a G'" 
125 130 " 135 



4 32 
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AAC GAG TGT GTC TC^GT AGT AAC TGT AAG AAA AGC CTG CTG TGC ACG 576 
Asn Glu Cys Val Ser Cys Ser Asn Cys Lys Lys Ser Leu Glu Cys Thr 
140 145 * 150 

AAG TTG TGC CTA CCC GAG ATT GAG AAT GTT AAG GGC ACT GAG GAC TCA G24 
tLys Leu Cys Leu Pro Glr. lie Glu Asn Val Lys Gly Thr Glu Asp Ser 
155 160 165 

GGC ACC ACA GTG CTG TTG CCC CTG GTC ATT TTC TTT GGT CTT TGC CTT 672 
Gly Thr Thr Val Leu Leu Pro Leu Val lie Phe Phe Gly Leu Cys Leu 
170 . 175 180 

TTA TCC CTC CTC TTC ATT GGT TTA ATG TAT CGC TAG CAA CGG TGG AAG '720 
Leu Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 
185 190 195 200 

TCC AAG CTC TAC TCC ATT GTT TGT GGG AAA TCG ACA CCT GAA AAA GAG 768 
Ser Lys Leu Tyr Ser He Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
205 210 215 

GGG GAG CTT GAA GGA ACT ACT ACT AAG CCC CTG GCC CCA AAC CCA AGC 816 
Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Se- 
220 225 230 



TTC AGT CCC ACT CCA GGC TTC ACC CCC ACC CTG GGC TTC AGT CCC GTG 
Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe Ser Pro Val 
235 240 245 



864 



CCC AGT TCC ACC TTC ACC TCC AGC TCC ACC TAT ACC CCC GGT GAC TGT 912 
Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr Pro Glv Aso Cvs 
250 255 260 - - - 



960 



1008 



CCC AAC TTT GCG GCT CCC CC-C AGA GAG GTG GCA CCA CCC TAT CAG GGG 
Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tv- Gin Glv 

270 275 " ' 280 

GCT GAC CCC ATC CTT GCG ACA GCC CTC GCC TCC GAC CCC ATC CCC AAC 
Ala Asp Pro He Leu Ala Thr Ala Leu Ala Ser Asp Pro lie Pro Psv 
285 290 295 

CCC CTT CAG AAG TGG GAG GAC AGC GCC CAC AAG CCA CAG AGC CTA Gi^C 1056 
Pro Leu Gin Lys Trp Glu Asp Ser Ala His Lvs Pro Gin Ser Leu Aso 
300 305 ■ 310 

ACT GAT GAC CCC GCG ACG CTG TAC GCC GTG GTG GAG AAC GTG CCC CCG liO^ 
Thr Asp Asp Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Vat ^-o ^ro 
315 320 325 

TTG CGC TGG AAG GAA TTC GTG CGG CGC CTA GGG CTG AGC GAC CAC GAG n =;2 

Leu Arg Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Aso His Glu 
330 335 340 



1200 



ATC GAT CGG CTG GAG CTG CAG AAC GGG CGC TGC CTG CGC GAG C-C"- CA.^^ 

lie As? Arg Leu Glu Leu Gin Asn Glv Arg Cvs Leu Arg G-u ^'-l rz': S 

350 - 355 ' Ho 

TAC AGC ATG CTG GCG ACC TGG AC-G CGG CGC ACG CCG CGG CGC G-G GC^ "2' = 

Tyr Ser Mer Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Ara Glu ^-'l ' " 

3^5 370 375 
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3C GTG CTC CGC GAC ATG GAC CTG^K 



ACG CTG GAG CTG CTG GaUEGC GTG CTC CGC GAC ATG GAC CTG^IK GGC 129c 
Thr I»ei: Glu Leu ^eu Gly Arg Val Leu Arg Asp Met Asp leu Leu Gly 
38C 385 39C 

TGC CTG GAG GAC ATC GAG GAG GCG CTT TGC GGC CCC GCC GCC CTC CCG 1^4 4 

Cys Leu Glu Asp lie Glu Glu Ala Leu Cys Gly Pro Ala Ala Leu Pro 
355 ' 4C0 " ' 4C5 

CCC GCG CCC AGT CTT CTC AGA TGA 13 £r 

Pro Ala Pro Ser Leu Leu Arc 
410 415 



(2) INTORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHAKACTERISTICS : 

(A) LENGTH: 455 ainir.c acids 
C3) TYPE: a.TjLnc acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQL^NCE DESCRIPTION: SEQ ID NO : 4 : 

Met. Gly leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Leu 
-40 -35 -30 -25 

Glu Leu Leu Val Gly He Tyr Pro Ser Gly Val He Gly Leu Val Pro 
-20 ' -15 -10 

His Leu Gly Asp Arg Glu Lys A.rg Asp Ser Val Cys Pro Gin Gly Lys 



Tyr He His Pro Gin Asn Asn Ser He Cys Cys Thr Lys Cys His Lys 

10 15 " ' 20 " 

Gly Thr Tyr Leu Tyr Asn Asp Cys Pro Glv Pro Giv Gin Aso Thr Asc 

25 30 ' 35 " * 40 

Cys Arg Glu Cys Glu Ser Gly Ser ?he Thr Ala Ser Glu Asn His Leu 
45 5C 55 

Arg His Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Vai 

6C ' 65 ' " 70 

Glu He Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cvs Arc 

75 SO " * 65 " " " 

Lys Asn Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe 

9C 95 IOC 

Asn Cys Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu 

1C5 lie " 115 ' 120 

lys Gin AST: Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu 

Asn Glu Cys Val Ser Cys Ser Asn Cys lys lys Ser leu Glu Cvs Tr.r 
14: 145 ' 15C 
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Lys Leu Cys Leu Pro Gin lie Glu Asn Val Lys Giy Thr Glu Asp Ser 
155 160 165 

Gly Thr Thr Val Leu Leu Pro Leu Val He Phe Phe Gly Leu Cys Leu 
^'70 175 180 

Leu Ser Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys 
185 190 200 

Ser Lys Leu Tyr Ser He Val Cys Gly Lys Ser Thr Pro Glu Lys Glu 
205 210 215 

Gly Glu Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser 
220 225 230 

Phe Ser Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe Ser Pro Val 
235 240 245 

Pro Ser Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr Pro Gly Asp Cvs 
250 255 260 ^ - ^ 

Pro Asn Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tyr Gin Gly 

270 275 280 

Ala Asp Pro He Leu Ala Thr Ala Leu Ala Ser Asp Pro He Pro Asn 
285 290 295 

Pro Leu Gin Lys Trp Glu Asp Ser Ala His Lys Pro Gin Ser Leu Asp 
300 305 310 

Thr Asp Asp Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Val Pro Pro 
315 320 325 

Leu Arg Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Asp His Glu 
^30 335 340 

He Asp Arg Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin 

350 355 360 

Tyr Ser Met Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Arg Glu Ala 
365 370 375 

Thr Leu Glu Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu Gly 
380 385 390 

Cys Leu Glu Asp He Glu Glu Ala Leu Cys Gly Pro Ala Ala Leu Pro 
395 400 405 

Pro Ala Pro Ser Leu Leu Arg 
410 415 

{2} INFORMATION FOR SEQ ID NO: 5: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3011 base pairs 
CB) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: cDNA to niRNA 
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(iii) Hy?C?KETICAL: N*0 

(iv) ANTI-SENSE: NO 

(vi; ORIGINAL SOURCE: 

( A ) C R3 AN ISM: Home sapiens 

(vii; IMMEriArE SOURCE: 

(3) CLONE: HUIL-IR 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(3) LOCATION: 64 . .1793 

(ix) FEATURE: 

(A) NAME/KEY: mat_pepride 
(3) LOCATION: 14 4.. 17 90 

(ix! FEATUP^: 

(A) NAME/KEY: sig^peptide 
(3) LOCATION: 64.. 143 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

A3ACGCACCC TCTGAAGATG GTGGACTCOC TOCTGAGAAG CTGGGACCCC TT3GTAAAAG 60 

ACAAGGCCTT CTCCAAGAAG AAT ATG AAA GTG TTA CTC AGA CTT ATT TGT 11 C 

Met Lvs Val Leu Leu Arc Leu He Cys 
-2C -15 

TTC ATA GOT CTA CTG ATT TCT TCT CTG GAG GCT GAT AAA TGC AAG GAA 15 8 

?he He Ala Leu Leu lie Ser Ser Leu Glu Ala Ksu Lys Cvs Lys Glu 
-10 .-5 I ^ ' 5 

CGT Gr^h GAA AAA ATA ATT TTA GTG TCA TCT GCA AAT GAA ATT GAT GTT 20 6 

Arg Glu Glu Lys He He Leu Val Ser Ser Ala Asn Glu He Asp Val 
10 15 2C 

CGT CCC TGT CCT CTT AAC CCA AAT GkA CAC AAA GGC ACT ATA ACT TG3 254 
Arg Pro Cys Pro Leu Asn Pro Asn Glu His Lys Gly Thr He Thr Trp 
25 30 ' ^ 35 ' 

TAT AA-i GAT GAC AGC AAG ACA CCT GTA TCT AOA GAA CAA GCC TCC AGG 3 02 

Tyr Lys A.sp Asp Ser Lys Thr Pro Val Ser Thr Glu Gin Ala Ser Arg 
40 45 50 

ATT CAT CAA CAC AAA GAG A-AA CTT TGG TTT GTT CCT GCT AAG GTG GAG 35 0 

He His Gin His Lys Glu Lys Leu Trp Phe Val Pro Ala Lvs Val Glu 
55 60 * 65 

GAT TCA GGA CAT TAC TAT TGC GTG GTA AGA AAT TCA TCT TAC TGC CTC 39a 
Asp Ser Gly His Tyr Tyr Cys Val Val Arg Asn Ser Ser Tvr Cvs Le-^ 
^ 75 ' ' 60 " " 65 

AGA ATT A^AA ATA AGT GCA AA„-. TTT GTG GAG rJ^.T GAG CCT AAC TTA TGT 44 6 

A.rg He Lys He Ser A.la Lys Phe Val Glu Asn Glu Pro Asn Leu Cys 
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ATA TTT AAG GAG AAA CTA CCC G'^^Z 



TAT AAT GCA CAA HSC ATA TTT AAG CAG AAA CTA CCC GI^^CA GGA GAC 494 
Tyr Asn Ala Gin Ala lie Phe Lys Gin Lys Leu Pro Val Ala Gly Asp 
105 110 115 



GGA 


GGA 


CTT 


GTG 


TGC 


CCT 


TAT ATG 


GAG 


TTT 




AAA 


AAT 


GAA 


AAT 


AAT 


542 


Gly 


Gly 


Leu 


Val 


Cys 


Pro 


Tyr Met 


Glu 


Phe 


Phe 


Lys 


Asn 


Glu 


Asn 


Asn 








U 


















130 
















^ a T\ 
AAA 


TTA 


CAG 


TGG TAT 


AAt? 


GAT 


TGC 


AAA 


CCT 


CTA 


CTT 


CTT 


590 


Glu 


Leu 


Pro 


Lys 


Leu 


Gin 


Trp Tyr 


Lys 


Asp 


Cys 


Lys 


Pro 


Leu 


Leu 


Leu 






135 










140 








145 












GAC 


AAT 


ATA 


CAC 


TTT 


AGT 


GGA GTC 


AAA 


GAT 


AGG 


CTC 


ATC 


GTG 


ATG 


AAT 


638 


Asp 


Asn 


He 


His 


Phe 


Ser 


Gly Val 


Lys 


Asp 


Arg 


Leu 


He 


Val 


Met 


Asn 




150 










155 








160 










165 




GTG 


GCT 


GAA 


AAG 


CAT 


AGA 


GGG AAC 


TAT 


ACT 


TGT 


CAT 


GCA 


TCC 


TAC 


ACA 


686 


Val 


Ala 


Glu 


Lys 


His 


Arg Gly Asn 


Tyr 


Thr 


Cys 


His 


Ala 


Ser 


Tyr 


Thr 












170 








175 










180 







TAC 
Tyr 


TTG 
Leu 


GGC 
Gly 


AAG 
Lys 
185 


CAA 
Gin 


TAT 
Tyr 


CCT 
Pro 


ATT 
He 


ACC 
Thr 
190 


CGG 
Arg 


GTA 
Val 


ATA 
He 


GAA 
Glu 


TTT 
Phe 
195 


ATT 
He 


ACT 
Thr 


734 


CTA 
Leu 


GAG 
Glu 


GAA 
Glu 
200 


AAC 
Asn 


AAA 
Lys 


CCC 
Pro 


ACA AGG CCT GTG 
Thr Arg Pro Val 
205 


ATT 
He 


GTG 
Val 


AGC 
Ser 
210 


CCA 
Pro 


GCT 
Ala 


AAT 
Asn 


782 


GAG 
Glu 


ACA 
Thr 
215 


ATG 
Met 


GAA 
Glu 


GTA 
Val 


GAC 
Asp 


TTG GGA 
Leu Gly 
220 


TCC 
Ser 


CAG 
Gin 


ATA 
He 


CAA 
Gin 
225 


TTG 
Leu 


ATC 
He 


TGT 
Cys 


AAT 
Asn 


830 


GTC 
Val 
230 


ACC 
Thr 


GGC CAG 
Gly Gin 


TTG 
Leu 


AGT GAC 
Ser Asp 
235 


ATT 
He 


GCT 
Ala 


TAC 
Tyr 


TGG 
Trp 
240 


AAG 
Lys 


Trp 


xiAi 

Asn 


(Zfzrz 
Gly 


TP'S 

Ser 
245 


0/0 


GTA 
Val 


ATT 
He 


GAT 
Asp 


GAA 
Glu 


GAT 
Asp 
250 


GAC 
Asp 


CCA 
Pro 


GTG 
Val 


CTA 
Leu 


GGG 
Gly 
255 


GAA GAC 
Glu Asp 


TAT 
Tyr 


TAC 
Tyr 


AGT 
Ser 
260 


GTG 
Val 


926 


GAA 
Glu 


AAT 
Asn 


CCT 
Pro 


GCA 
Ala 
265 


AAC 
Asn 


AAA 

Lys 


AGA 
Arg 


AGG 
Arg 


AGT 
Ser 
270 


ACC 
Thr 


CTC 
Leu 


ATC 
He 


ACA 
Thr 


GTG 
Val 
275 


CTT 
Leu 


AAT 
Asn 


97 4 


ATA 
He 


TCG 
Ser 


GAA 
Glu 
280 


ATT 
He 


GAA 
Glu 


AGT 
Ser 


AGA 
Arg 


TTT 
Phe 
285 


TAT 
Tyr 


AAA 
Lys 


CAT 
His 


CCA 
Pro 


Phe 
290 


ACC 
Thr 


TGT 
Cys 


TTT 
Phe 


1022 


GCC 
Ala 


AAG 
Lys 
295 


AAT 
Asn 


ACA 
Thr 


C^.T 
His 


GGT 
Gly 


ATA 
He 
300 


GAT 
Asp 


GCA 
Ala 


GCA 
Ala 


TAT 
Tyr 


ATC 
He 
305 


CAG 
Gin 


TTA 
Leu 


ATA 
He 


TAT 
Tyr 


107C 


CCA 
Pro 
310 


GTC 
Val 


ACT 
Thr 


AAT 
Asn 


TTC 
Phe 


CAG 
Gin 
315 


AAG 
Lys 


CAC 
His 


ATG 
Met 


ATT 
He 


GGT 
Gly 
320 


ATA 
He 


TGT 
Cys 


GTC 
Val 


ACG 
Thr 


TTG 
Leu 
325 


1113 


ACA 
Thr 


GTC 
Val 


ATA 
He 


ATT 
He 


GTG 
Val 
330 


TGT 
Cys 


Ser 


GTT 
Val 


TTC 
Phe 


ATC 
He 
335 


TAT 
Tyr 


AAA 
Lys 


ATC 
He 


TTC 
Phe 


AAG 
Lys 
34 0 


ATT 
He 


1166 
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ATT GTG CTT TGG TAG AG^G GAT TCC TGC TAT GAT TTT CTC CCA ATA 121 4 

Asp lie Val Leu Trp Tyr Arg Asp 5er Cys Tyr Asd Phe Lei: Pre lie 
345 35C 355 

AAA GCT TCA GAT GGA AAG ACC TAT GAC GCA TAT ATA CTG TAT CCA AAG 12 52 

Lys Ala Ser Asp Gly Lys Thr Tyr Asp Ala Tyr lie Leu Tvr Prr lys 

ACT GTT GGG GAA GGG TCT ACC TCT GAC TGT GAT ATT TTT GTG TTT AAA 131 C 

Tr.r Val Gly Glu Gly Ser Thr Ser Asp Cys Asp lie Phe Val Phe Lvs 
375 380 " 385 

GTC TTG CCT GAG GTC TTG GAA AAA CAG TGT GGA TAT AAG CTG TTC ATT 1356 
Val Leu Pre Glu Val Leu Glu Lys Gin Cys Glv Tvr Lvs Leu Phe lie 

395 " 400 ^ ^ 405 

TAT GGA AGG GAT GAC TAG GTT GGG GAA GAC ATT GTT GAG GTC ATT AAT 140 6 

Tyr Gly Arg Asp Asp Tyr Val Gly Glu Asp lie Val Glu Val lie Asn 
410 415 420 

GAA AAC GTA AAG AAA AGC AGA AGA CTG ATT ATC ATT TTA GTC AGA GAA 14 54 

Glu Asn Val Lys Lys Ser Arg Arg Leu lie lie lie Leu Val Arg Glu 
425 430 435 

ACA TCA GGC TTC AGC TGG CTG GGT GGT TCA TCT GAA GAG CAA ATA GCC 1502 
Thr Ser Gly Phe Ser Tr? Leu Gly Gly Ser Ser Glu Glu Glr. lie Ala 
440 445 450 

ATG TAT AAT GCT CTT GTT CAG GAT GGA ATT AAA GTT GTC CTG CTT GAG 155 0 

Met Tyr Asn Ala Leu Val Gin Asp Gly lie Lvs Val Val Leu Leu Glu 
455 460 465 

CTG GAG AAA ATC CA.^^. GAC TAT GAG AAA ATG CCA GAA TCG ATT AAA TTC 1596 
Leu Glu Lys He Gin Asp Tyr Glu Lvs Met Pro Glu Ser He Lvs ^he 
^"^0 475 480 ' 485 

ATT AAG CAG AAA CAT GGG GCT ATC CGC TGG TCA GGG GAC TTT ACA CAG 164 6 

He Lys Gin Lys His Gly Ala He Arg Tro Ser Gly Asp Phe Thr G^ 
^90 495 ■ " ioc 

GGA CCA CAG TCT GCA AAG ACA AGG TTC TGG AAG AAT GTC AGG TAC CAC 169^ 
Gly Pre Gin Ser Ala Lys Thr Arg Phe Trc Lvs Asn Val Ara Tyr s 
505 510 ' 515 

ATG CCA GTC CAG CGA CGG TCA CCT TCA TCT AAA CAC CAG TTA CTG TCA 17 42 

Met Pre Val Gin Arg Arg Ser Pro Ser Ser Lys Kis Gin Leu Leu Ser 

53C 



d2o 



CCA GCC ACT AAG GAG AAJi CTG CAA AGA GAG GCT CAC GTG CCT CTC GGG 17 90 

Pro Ala Thr Lys Glu Lys Leu Gin Arg Glu Ala His Val Pro Le- v 
535 540 545 ^ '* 

TAGCATGGAG A^AGTTGCCP-^. GAGTTCTTTA GGTGCCTCCT GTCTTATGGC GZTGZkGGZZ ' S' C 
AGGTTATG3C TCATGCTGAC TTGCAGAGTT CATGGAATGT 



^ - '^rt.'^o * C* ^joAATCAG A.TTATTAAGG 



- r^-^^^r. * or-. „ o * wi-_r. - A .jrCAGC CCAG 



uuw.-.CTTCAG A3TAGAGGGC TTGGGAAGAT CTTTT AAAAA 
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GCACTTTGGG AGGCTGAAGT GGGTGGATCA CCAGAGGTCA 20 90 

GGAGTTCGAG ACCAGCCCAG CCAACATGGC AAAACCCCAT CTCTACTAAA AATACAAAAA 2150 

TGAGCTAGGC ATGGTGGCAC ACGCCTGTAA TCCCAGCTAC ACCTGAGGCT GAGGCAGGAG 2210 

AATTGCTTGA ACCGGGGAGA CGGAGGTTGC AGTGAGCCGA GTTTGGGCCA CTGCACTCTA 227 0 

GCCTGGCAAC AGAGCAAGAC TCCGTCTCAA AAAAAGGGCA ATAAATGCCC TCTCTGAATG 2330 

TTTGAACTGC CAAGAAAAGG CATGGAGACA GCGAACTAGA AGAAAGGGCA AGAAGGAAAT 2390 

AGCCACCGTC TACAGATGGC TTAGTTAAGT CATCCACAGC CCAAGGGCGG CGGCTATGCC 2450 

TTGTCTGGGG ACCCTGTAGA GTCACTGACC CTGGAGCGGC TCTCCTGAGA GGTGCTGCAG 2510 

GCAAAGTGAG ACTGACACCT CACTGAGGAA GGGAGACATA TTCTTGGAGA ACTTTCCATC 257 0 

TGCTTGTATT TTCCATACAC ATCCCCAGCC AGAAGTTAGT GTCCGAAGAA GAGCTTGAAA 2 630 

ACTCACTTCA ATGAACAAAG GGATTCTCCA GGATTCCAAA GTTTTGAAGT CATCTTAGCT 2690 

TTCCACAGGA GGGAGAGAAC TTAAAAAAGC AACAGTAGCA GGGAATTGAT CCACTTCTTA 2750 

ATGCTTTCCT CCCTGGCATG ACCATCCTGT CCTTTGTTAT TATCCTGCAT TTTACGTCTT 2 810 

TGGAGGAACA GCTCCCTAGT GGCTTCCTCC GTCTGCAATG TCCCTTGCAC AGCCCACACA 2 87 0 

TGAACCATCC TTCCCATGAT GCCGCTCTTC TGTCATCCCG CTCCTGCTGA AACACCTCCC 2930 

AGGGGCTCCA CCTGTTCAGG AGCTGAAGCC CATGCTTTCC CACCAGCATG TCACTCCCAG 2990 
ACCACCTCCC TGCCCTGTCC T 3 OH 

(2) INFORMATION FOR SEQ XD NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56S amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6; 

Met.Lys Val Leu Leu Arg Leu He Cvs Phe He Ala Leu Leu 11^ Se^ 
"20 -15 -10 -5 

Ser Leu Glu Ala Asp Lys Cys Lys Glu Arg Glu Glu Lys He He Leu 
15 10 

Val Ser Ser Ala Asn Glu He Asp Val Arg Pro Cys Pro Leu Asn Pro 
15 20 25 

Asn Glu His Lys Gly Thr He Thr Trr? Tvr Lvs Aso Asd Ser Ly«^ '^^.^^r 

30 35 * ' " 40 * ' 

Pro Val Ser Thr Glu Gin Ala Ser Arg He His Gin His Lvs Glu Lvs 

50 55 ' 60 
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Leu Trp Phe Val Pre hla Lvs Wal Glu Asd Ser Glv His Tvr Tyr Cvs 
65 7C ' ' 75 

Val Val Arc Asn Ser Ser Tyr Cys Le-j Arg lie lvs lie Ser Ala Lvs 
ec 55 ' ' 90 ^ 

Phe Val Glu Asn Glu Pro Asn Leu Cys ?yr Asn Ala Gin Ala lie Phe 
?5 IOC ^ :C5 

Lys Gin Lys Leu Pro Val Ala Gly Asp Glv Gly Leu Val Cvs Pro Tyr 

llD 12C 

Met Glu Phe Phe Lys Asn Glu Asn Asn Glu Leu Pro Lys Leu Gin Trrs 
-25 130 135 140 

Tyr Lys Asp Cys Lys Pre Leu Leu Leu Asi: Asn lie His Phe Ser Gly 
1^5 150 155 

Val Lys Asp Arg Leu lie Val Met Asn Val Ala Glu Lys His Arg Gly 
-60 165 170 

Asn Tyr Thr Cys His Ala Ser Tyr Thr Tyr Leu Gly Lvs Gin Tvr Pro 
^*?5 180 185 

He Thr Arg Val He Glu Phe He Thr Leu Glu Glu Asn Lys Pro Thr 

195 200 

Arg Pro Val He Val Ser Pro Ala Asn Glu Thr Met Glu Val Asp Leu 
205 210 215 220 

Gly Ser Gin He Gin Leu He Cys Asn Val Thr Gly Gin Leu Ser Asp 
225 230 235 

He Ala Tyr Trp Lys Trp Asn Gly Ser Val He Asp Glu Asn Astd Pro 
240 245 250 

Val Leu Gly Glu Asp Tyr Tyr Ser Val Glu Asn Pro Ala Asn Lys Ara 
255 260 265 

Arg Ser Thr Leu He Thr Val Leu Asn He Ser Glu He Glu Ser Arc 
2'70 275 280 

Phe Tyr Lys His Pro Phe Thr Cys Phe Ala Lys Asn Thr HiS Gly He 
285 290 295 300 

Asp Ala Ala Tyr He Gin Leu He Tyr Pro Val Thr Asn Phe Gin Lys 
305 310 315 

His Met He Gly He Cys Val Thr Leu Thr Val He He Val Cys Ser 
220 325 330 

Val Phe He Tyr Lys He Phe Lys He Asp He Val Leu Trp Tyr Arg 
335 540 345 

Asp Ser Cys Tyr Asp Phe Leu Pro He Lvs Ala Ser hso Glv Lys T'r^ 
350 355 ' 560 ^ 

Tyr Asp Ala Tyr He Leu Tyr Pro Lys Thr Val Gly Glu Gly Ser Thr 
3^3 370 ' 375 " ^ ^ 3B0 
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Ser Asp Cys Asp lie Phe Val Phe Lys Val Leu Pro Giu Vai Leu Glu 
385 390 395 

Lys Gin Cys Gly Tyr Lys Leu Phe lie Tyr Gly Arg Asp Asp Tyr Val 
400 405 410 

Gly Glu Asp lie Val Glu Val lie Asn Glu Asn Val Lys Lys Ser Arg 
415 420 425 

Arg Leu He He He Leu Val Arg Glu Thr Ser Gly Phe Ser Trp Leu 
430 435 440 

Gly Gly Ser Ser Glu Glu Gin He Ala Met Tyr Asn Ala Leu Val Gin 

450 455 460 

Asp Gly He Lys Val Val Leu Leu Glu Leu Glu Lys He Gin Asp Tyr 
465 470 475 

Glu Lys Met Pro Glu Ser He Lys Phe He Lys Gin Lys His Gly Ala 
480 485 490 

He Arg Trp Ser Gly Asp Phe Thr Gin Gly Pro Gin Ser Ala Lys Thr 
495 500 505 

Arg Phe Trp Lys Asn Val Arg Tyr His Met Pro Val Gin Arg Arg Ser 
510 515 520 

Pro Ser Ser Lys His Gin Leu Leu Ser Pro Ala Thr Lys Glu Lys Leu 
525 530 535 540 

Gin Arg Glu Ala His Val Pro Leu Gly 
545 



(2) INFORMATION- FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: . NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: Human B cell lymohoblastoid 

(H) CELL LINE: CB23 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: pHuIL-lRIl75 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

CB) LOCATION; 154.. 1350 
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(ix) FEATURE: 

(A) NAME/KEY: rr^t_peptide 

(E) LOCATION: 193.. 1547 

(ix; FEATURE: 

(A) KAME/KEY: sic^peptide 

(H; LOCATION: 154.. 192 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

CTGGAAAATA CATTCTGCTA CTCTTAAAAA CTAGTGACGC TCATACAAAT CAACAGAAAG 60 

AGCTTCTGAA GGAAGACTTT AAAGCTGCTT CTGCCACGTG CTGCTGGGTC TCAGTCCTCC 12 C 

ACTTCCCGTG TCCTCTGGAA GTTGTCAGGA GCA ATG TTG CGC TTG TAC GTG TTG 174 

Met Leu Arc Leu Tyr Val Leu 
-13 ' -10 

GTA ATG GGA GTT TCT GCC TTC ACC CTT CAG COT GCG GCA CAC ACA GGG 222 
Val Met Gly Val Ser Ala Phe Thr Leu Gin Pro Ala Ala His Thr Gly 
-5 15 10 

GCT GCC AGA AGO TGC CGG TTT CGT GGG AGG CAT TAC AAG CGG GAG TTC 27 0 

Ala Ala Arg Ser Cys Arg Phe Arg Gly Arc Kis Tyr Lys Arg Glu Phe 
15 ''26 ' 25 

AGG CTG GAA GGG GAG OCT GTA GCC CTG AGG TGC CCC CAG GTG CCC TAC 318 
Arg Leu Glu Gly Glu Pro Val Ala Leu Arg Cys Pro Gin Val Pre Tyr 
30 35 4C 

TGG TTG TGG GCC TCT GTC AGO CCC CGC ATC AAC CTG ACA TGG CAT AAA 366 
Trp Leu Trp Ala Ser Val Ser Pro Arg lie Asn Leu Thr Trp His Lys 
45 50 55 

AAT GAC TCT GCT AGG ACG GTC CCA GGA GAA GAA GAG ACA CGG ATG TGG 414 
Asn Asp Ser Ala Arg Thr Val Pro Gly Glu Glu Glu Thr Arg Met: Trp 
60 65 " 70 ^ ' 

GCC CAG GAC GGT GCT CTG TGG CTT CTG CCA GCC TTG CAG GAG GAC TCT 4 62 

Ala Gin Asp Gly Ala Leu Trp Leu Leu Pro Ala Leu Gin Glu Asp Ser 
75 80 85 SO 

GGC ACC TAC GTC TGC ACT ACT AGA AAT GCT TCT TAC TGT GAC AAA ATG 510 
Gly Thr Tyr Val Cys Thr Thr Arg Asn Ala Ser Tyr Cys Asp Lys Met 
95 100 105 

TCC ATT GAG CTC AGA GTT TTT GAG AAT ACA GAT GCT TTC CTG CCG TTC 55 £ 

Ser lie Glu Leu Arg Val Phe Glu Asn Thr Asp Ala Phe Leu Pro Phe 
110 115 ' 120 

ATC TCA TAC CCG CAA ATT TTA ACC TTG TCA ACC TCT GGG GTA TTA GTA 6 06 

He Ser Tyr Pro Gin lie Leu Thr Leu Ser Thr Ser Gly Val Leu Val 
125 130 135 

TGC CCT GAC CTG ACT GA^. TTC ACC CGT GAC A«-?i ACT GAC GTG AAG ATT 6 54 

Cys Fro Asp leu Ser Glu Phe Thr Arg Asp Lys Thr Asp Val Lys He 

^ ^ \J -^u 
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CAA TGG TAG AAG GAT TCT CTT CTT TTG GAT AAA GAC AAT GAG AAA TTT 7 02 

Gin Trp Tyr Lys Asp Ser Leu Leu Leu Asp Lys Asp Asn Glu Lys Phe 
3-55 160 165 170 

CTA AGT GTG AGG GGG ACC ACT CAC TTA CTC GTA CAC GAT GTG GCC CTG 750 
Leu Ser Val Arg Gly Thr Thr His Leu Leu Val His Aso Val Ala Leu 
175 ISO * 185 

GAA GAT GCT GGC TAT TAG CGC TGT GTC CTG ACA TTT GCC CAT GAA GGC 7 98 

Glu Asp Ala Gly Tyr Tyr Arg Gys Val Leu Thr Phe Ala His Glu Gly 
190 195 200 

CAG CAA TAG AAC ATC ACT AGG AGT ATT GAG CTA CGC ATC AAG AAA AAA 846 
Gin Gin Tyr Asn lie Thr Arg Ser lie Glu Leu Arg He Lys Lys Lvs 
205 210 215 

AAA GAA GAG ACC ATT GCT GTG ATC ATT TCG CGC CTC AAG ACC ATA TCA 8 94 

Lys Glu Glu Thr He Pro Val He He Ser Pro Leu Lys Thr He Se- 
220 225 230 

GCT TGT CTG GGG TCA AGA CTG ACA ATC CCG TGT AAG GTG TTT CTG GGA 942 
Ala Ser Leu Gly Ser Arg Leu Thr He Pro Cys Lys Val Phe Leu Gly 
235 240 245 250 

ACC GGC ACA CCG TTA ACC ACC ATG CTG TGG TGG AGG GCC AAT GAC ACC 990 
Thr Gly Thr Pro Leu Thr Thr Met Leu Trp Trp Thr Ala Asn Aso Thr 
255 260 265 

CAC ATA GAG AGG GCC TAG GGG GGA GGC GGC GTG ACC GAG GGG CCA GGC 1038 
His He Glu Ser Ala Tyr Pro Gly Gly Arg Val Thr Glu Glv Pro A-g 
270 275 280 

CAG GAA TAT TCA GAA AAT AAT GAG AAC TAG ATT GAA GTG GGA TTG ATT 10 8 6 

Gin Glu Tyr Ser Glu Asn Asn Glu Asn Tyr He Glu Val Pro Leu He 
285 290 295 

TTT GAT GCT GTC ACA AGA GAG GAT TTG CAC ATG GAT TTT AAA TGT GTT 1134 
Phe Asp Pro Val Thr Arg Glu Asp Leu His Met Aso Phe Lvs Cys Val 
300 305 3ib 

GTC CAT AAT ACC CTG AGT TTT CAG ACA CTA GGC ACC ACA GTC AAG GAA 1182 
Val His Asn Thr Leu Ser Phe Gin Thr Leu Arg Thr Thr Vat Lvs Glu 

320 325 ' 330 

GCC TGG TGG ACG TTG TGG TGG GGC ATT GTG CTG GCC CCA GTT TCA CTG 1230 
Ala Ser Ser Thr Phe Ser Trp Gly He Val Leu Ala Pro Leu Ser Leu 
335 340 345 

GCC TTG TTG GTT TTG GGG GGA ATA TGG ATG CAC AGA GGG TGG AAA CAC i 278 

Ala Phe Leu Val Leu Gly Gly He Trp Met His Arg Arg Gys Lvs His 
350 355 360 

AGA AGT GGA AAA GCA GAT GGT CTG ACT GTG GTA TGG GCT GAT GA.T CAA 1325 
Arg Thr Gly Lys Ala Asp Gly Leu Thr Val Leu Tro Pro His His G"n 
365 370 ' 375 

GAC TTT GA-A TCG TAT GCC AAG TGAAATAAAT 
Asp Phe Gin Ser Tvr Pro Lvs 
380 3*85 



SUBSTITUTE SHEET 
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(2) INFORMATION FOR SEQ 10' NO : 6 : 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39£ arr^r.c acids 
(3) TYPE: Ejvlno ac^ci 
(C) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE CESCRIPTION: SEQ ID NO: 6: 

Met Leu Arg Leu Tyr VaL Leu Val Met Gly Val Ser Ala Phe Thr Leu 
-13 -10 -5 1 

Gin Pro Ala Ala His Thr Gly Ala Ala Arg Ser Cys Arc Phe Arg Glv 
^ 10 15 " " 

Arg His Tyr Lys Arg Glu Phe Arg Leu Glu Gly Glu Pro Val Ala Leu 
20 25 30 35 

Arg Cys Pro Gin Val Pre Tyr Trp Leu Trp Ala Ser Val Ser Pro Arg 
^2 45 50 

He Asn Leu Thr Trp His Lys Asn Asp Ser Ala Arg Thr Val Pro Gly 
55 6C 65 

Glu Glu Glu Thr Arg Met Tro Ala Gin Asp Glv Ala Leu Tr^ Leu Leu 
70 75 " 80 * 

Pro Ala Leu Gin Glu Asp Ser Gly Thr Tyr Val Cvs Thr Thr Arg Asn 
85 9C 95 

Ala Ser Tyr Cys Asp Lys Met Ser He Glu Leu Ara Val Phe Glu Asn 
^00 105 110 ^ 115 

Thr Asp Ala Phe Leu Pro Phe He Ser Tvr Pro Glr He Leu Thr Leu 
120 125 130 

Ser Thr Ser Gly Val Leu Val Cys Pro Asp Leu Ser Glu Phe Thr Arc 
135 140 ' 145 

Asp Lys Thr Asp Val Lys He Gin Trp Tyr Lys Asp Ser Leu Leu Leu 
-50 155 160 

Asp Lys Asp Asn Glu Lys Phe Leu Ser Val Arg Gly Thr Thr His Leu 
165 170 175 

Leu Val His Asp Val Ala Leu Glu Asp Ala Glv Tyr Tyr Arg Cvs Val 
180 165 190 ' 195 

Leu Thr Phe Ala His Glu Gly Gin Gin Tyr Asn He Thr Ara Ser He 
200 205 ^ 210 

Glu Leu Arg lie Lys Lys Lys Lys Glu Glu Thr He Pro Val He He 
215 220 225 

Ser Pre Leu Lys Thr He Ser Ala Ser Leu Glv Ser Arc L^u Tr - 
230 225 " 240 
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Pro Cys Lys Val Phe Leu Gly Thr Giy Thr Pro Leu Thr Thr Met Leu 

245 250 255 

Trp Trp Thr Ala Asn Asp Thr His lie Glu Ser Ala Tyr Pro Gly Gly 

265 270 275 

Arg Val Thr Glu Gly Pro Arg Gin Glu Tyr Ser Glu Asn Asn Glu Asn 
280 285 290 

Tyr He Glu Val Pro Leu He Phe Asp Pro Val Thr Arg Glu Asp Leu 
295 300 305 

His Met Asp Phe Lys Cys Val Val His Asn Thr Leu Ser Phe Gin Thr 
310 315 320 

Leu Arg Thr Thr Val Lys Glu Ala Ser Ser Thr Phe Ser Trp Gly He 
325 330 335 

Val Leu Ala Pro Leu Ser Leu Ala Phe Leu Val Leu Gly Gly He Trp 

345 350 355 

Met His Arg Arg Cys Lys His Arg Thr Gly Lys Ala Asp Gly Leu Thr 
360 365 370 

Val Leu Trp Pro His His Gin Asp Phe Gin Ser Tyr Pro Lys 
3*75 380 385 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
CO) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9; 
CTGAAGGGAG CACTGGCGAC TAAGGATCCA 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GATCTGTAAC GTGGTGGCCA TCCCTGGGAA TGCAAGCATG GATGC 45 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
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(B) TYPE: nucleic acid 
(C; STRArOEDNESS : single 
{Z) TOPOLOGY: lir.ear 



(xi) SEQUENCE OHSCRIPT lOK : SEQ IE' \'C:11: 

AGTCTG0AC3 TCCACGrCOC CCACCC33TG AGC 

(2) IKEORMAriOK FOR SEQ 10 K0:12: 

(i; SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GATCTGTTGA GO 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESE: single 
(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 3 : 
CTGA^z-^ACATC AGACGTGGTG TGCAAGCCCT GTTAAA 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 : 
GCGTCGACCT AGTGACGCTC ATACAAATC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQL^ENCE CHARACTERISTICS: 
(A) LENGTH: 33 base Dairs 
(P) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGCGGCCGC TCAGGAGGAG GCXTCCTTGA CTG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to inRNA 
(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 65 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

CT GAA GGG AGC ACT GGC GAC GGT GGC GGT GGA TCC GGC GGT GGC GGC 
Glu Gly Ser Thr Gly Asp Gly Gly Gly Gly Ser Gly Gly Gly Gly 



GGC TCA TTG CCC GCC CAG G 
Gly Ser Leu Pro Ala Gin 
20 



(2) INFORMATION FOR SEQ ID NO; 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Gly Ser Thr Gly Asp Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 



1 



5 



10 



15 



1 



5 



10 



15 



Ser Leu Pro Ala Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEC'JEKCE DESCRIPTION*; SEQ IC KC : 1 S : 

A.CCGAGGGAC CTGA3CG 17 

(2) INTGPJ^^ATICX FOR SEQ IC NC : 1 9 : 

(i) SEQUENCE CHARA.CCERISTICS : 
(A; LENGTH: 55 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPCLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

TGAGCCGCC3 CCACCGCCGG ATCCACCGCC ACCAGTGACT GGATATATTA ACT 5 3 

(2) INFORMATION FOR SEQ ID N0:2C: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 51 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDK^SS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGTGGCGGTG GATCCGGCGG TGGCGGCGGC TCATTGCCCG CCCAGGTGGC A 51 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCTTGCTCAG CGCGCAGT i q 
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CLAIMS 

What is claimed is: 



1. A fusion protein comprising a human tumor necrosis factor receptor (TNF- 
R) and a human interleukin-1 receptor (IL-IR). 

2. A fusion protein according to claim 1, comprising two TNF-Rs and one IL- 
IR, wherein said fusion protein is of a formula selected from: 



TKF-R - linker - TNF-R - linker - IL-IR and 
15 IL-IR - linker - TNF-R - linker - TNF-R • 

wherein each linker is a peptide linker. 

3. A fusion protein according to claim 2 wherein each of the peptide linkers 
20 comprises from 5 to 100 amino acids selected from the group consisting of glycine, 

asparagine, serine, threonine, and alanine. 

4. A fusion protein according to claim 2 wherein each TNF-R is a soluble 
TNF-R and the IL-IR is a soluble IL-IR. 

25 

5. A fusion protein according to claim 4 wherein each soluble TNF-R 
comprises an amino acid sequence selected from the group consisting of amino acids 1- 
X and -22-x of SEQ ID NO:l, wherein x represents an integer from 163 to 235. 

3^ 6. A fusion protein according to claim 4 wherein the soluble IL-IR comprises 

an amino acid sequence selected from die group consisting of amino acids 1-x and -20- 
X of SEQ ID N0:5, wherein x is 312-316. 

7. A fusion protein according to claim 4 wherein the soluble IL-IR comprises 
35 an amino acid sequence selected from the group consisting of amino acids 1-x and -13- 
X of SEQ ID N0:7, wherein x is 330-333. 
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8. A fusion protein according to claim 1 or 2, 

wherein said TNT-R is encoded by a TNF-R-encoding DNA selected from the 
group consisting of DNA comprising the nucleodde sequence presented in SEQ ID 
NO:l or SEQ JD NO:3, and DNA capable of hybridizing under moderately stringent 
conditions to the nucleotide sequence of SEQ ID NO:l or SEQ ID NO:3, wherein said 
TNF-R-encoding DNA encodes a biologically active TNT-R polypeptide; and 

wherein said IL-IR is encoded by an IL-lR-encoding DNA selected from the 
group consisting of DNA comprising the nucleotide sequence presented in SEQ ID 
N0:5 or SEQ TD N0:7, and DNA capable of hybridizing under moderately stringent 
conditions to the nucleotide sequence of SEQ ED N0:5 or SEQ JD N0:7, wherein said 
IL-lR-encoding DNA encodes a biologically active IL-IR polypeptide. 

9. A fusion protein of the formula: 

TNF-R - linker - TNT-R 
wherein the linker is a peptide linker. 

10. A fusion protein according to claim 9 wherein said peptide linker comprises 
from 5 to 100 amino acids selected from the group consisting of glycine, asparagine, 
serine, threonine, and alanine. 



15 



11. A fusion protein according to claim 9 wherein each TNF-R is a soluble 
25 TNT-R polypeptide. 

12. A fusion protein according to claim 11, wherein each soluble TNF-R 
comprises an amino acid sequence selected from the group consisting of amino acids 1- 
X and -22-x of SEQ ID NO: 1 , wherein x represents an integer from 163 to 235 

30 

13. A fusion protein according to claim 9, 

wherein each TNT-R is encoded by a TNF-R-encoding DNA selected from the 
group consisting of DNA comprising die nucleotide sequence presented in SEQ ID 
NO:l or SEQ ID N0:3, and DNA capable of hybridizing under moderately stringent 
35 conditions to the nucleotide sequence of SEQ ID NO: 1 or SEQ ID N0:3, wherein said 
TNT-R-encoding DNA encodes a biologically active TNT-R pol\peptide. 
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14. An isolated DNA sequence that encodes a fusion protein according to claim 1. 

15. An expression vector comprising a DNA sequence according to claim 14. 

16. An isolated DNA sequence that encodes a fusion protein according to claim 2, 



3 or 4. 

17. An expression vector comprising a DNA sequence according to claim 16. 

18. An isolated DNA sequence that encodes a fusion protein according to claim 9. 

19. An expression vector comprising a DNA sequence according to claim 18. 
15 20. A process for producing a fusion protein of the foimula: 

TNF-R - linker - TNF-R - Unker - IL- 1 R or 
IL-IR - Hnker - TNF-R - linker - TNF-R 

20 

comprising culturing a host cell transformed with an expression vector according to 
claim 17 under conditions that promote expression of said fusion protein, and 
recovering said fusion protein. 

^5 21. A process for producing a fusion protein of the formula TNF-R - 

polypeptide linker - TNF-R comprising culturing a host cell transformed with an 
expression vector according to claim 19 under conditions that promote expression of 
said fusion protein, and recovering said fusion protein. 



30 



22. A pharmaceutical composition comprising a fusion protein according to 
claim 1, 2 or 9 and a suitable diluent, excipient, or carrier. 
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CTG GGCTGC GAoGGC GCG AGGGCGv-lj AGGvjCAGGGt=\jCAACCGGACCCCGCCC'o wATC w i • 

ATG GCG CCC GTC GCC GTC TGG GCC GCG CTG GCC GTC GGA CTG GAG 132 

Mer Ala ?rc Val Ala Val Tro Ala Ala Leu Ala Val Glv Leu Glu -r 

i Bgll ' 

CTC TGG GCT GCG GCG CAC GCC TTG CCC GCC CAG GTG GCA TTT ACA IT" 

Leu Tr? Ala Ala Ala His Ala Pro Ala Gin Val Ala ?he Tr.r = 

CCC 7AC GCC CCG GAG CCC GGG AGC ACA TGC CGG CTC AGA G.-.A TAC 2 22 

Pro Tyr Ala Pro Glu Pro Gly Ser Thr Cys Arg Leu Arc Glu Tyr 23 

TAT GAC CAG ACA GCT CAG ATG TGC TGC AGC hKk TGC TCG CCG GGC 2 5" 

Tyr Asp Gin Thr Ala Gin Met Cys Cys Ser Lys Cys Ser Pro Gly 3 = 



CAA CAT GCA AAA GTC TTC TGT ACC >J^Q ACC TCG GAC ACC GTG TGT 
Gin His Ala Lys Val Phe Cys Thr Lys Thr Ser Asp Thr Val Cys 



CTG TGC GCG CCG CTG CGC AAG TGC CGC CCG GGC TTC GGC GTG GCC 
Leu Cys Ala Pro Leu Arc Lys Cys Arg Pro Gly Phe Gly Val A.la 



GurC GAC TTC GCT CTT CCA GTT GGA CTG AT- GTG Gvj- - o A-.- 'o^. 
G 1 V Asp ^he AiE l.-^.'i v^l Glv ' .^\'. "'^ v^l ^Iv Vgil ^'r -.l: 

^ FIG 2 



GAC TCC TGT GAG GAC AGC ACA TAC ACC CAG CTC TGG >JkZ TGG GTT 2 57 

A.sp Ser Cys Glu As? Ser Thr Tyr Thr Gin Leu Trp Asn Trp Val ££ 

CCC GAG TGC TTG AGC TGT GGC TCC CGC TGT AGC TCT GAC CAG GTG 4 02 

Pro Glu Cys Leu Ser Cys Gly Ser Arg Cys Ser Ser As? Gin Val 53 

GAA. ACT CAA GCC TGC ACT CGG GAA CAG AAC CGC ATC TGC ACC TGC 4 47 

Glu Thr Gin Ala Cvs Thr Ara Glu Gin A.sn Arg lie Cvs Thr Cvs 55 

' ^ Esp I 

AGG CCC GGC TGG TAC TGC GCG CTG AGC AJiG CAG GAG GGG TGC CGG 4 52 

.Arc Pro Gly Trp Tyr Cys Ala Leu Ser Lys Gin Glu Gly Cys A.rc 113 



537 
- * = 



AGA CCA GGA ACT GAA ACA TCA GAC GTG GTG TGC AAG CCC TGT GCC 5 52 
Arc Pro Gly Thr Glu Thr Ser As? Val Val Cys Lys Pro Cys Ala 143 



CCG GGG ACG TTC TCC AAC ACG ACT TCA TCC ACG GAT ATT TGI AGG £ 
Pro Gly Thr Phe Ser A.sn Thr Thr Ser Ser Thr Asp lie Cys A.rg 



- Z 2 



CCC CAC CAG ATC TGT AAC GTG GTG GCC ATC CCT GGG .-.--T GCA AGC £"2 

Pro His Gin lie Cvs Asn Val Val Ala He Pro Glv Asn Ala Ser 1" 

T 

ATG GAT GCA GTC TGC ACG TCC ACG TCC CCC ACC CGG AGT ATG GCC Tl 

Met Asp Ala Val Cys Thr Ser Thr Ser Pro Thr Arc Ser Mei A.la IE 

Cw.-. G\jG GCA GTA CAC T-A CAG C—A G-'o .C*- .-.w.-. Co.-. . — w."-". *■ - 

Pro Gly Ala Val His Leu Pro Gin Pro Val Ser Thr Arc Ser Gin 2: 

CA>- AC'*r CAG CCA A-CT C^A '^jtA.-. CCC AG — AC- G'. — CwA .-. A — - »^ z ^ 

His Thr Gin Pre Thr Pro Glu Pro Ser Thr Ala Pro Ser Thr Ser 21 

4- Pvu II 

T-N- C^G CTC CCA ATG GGC CwC AGC CCC CCA G^ — G.--A G-^^ A-*- .-.C- 
Phe Leu Leu Pro Met Glv Pro Ser Pro Pro Ala Glu Glv Ser Thr 



2 2 
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TT-T GoT CTA CTA ATA ATA GGA GT3 GTG AAZ TGT 2TZ AT3 AT-^ A^* 

^, e 'J ^ - ' g 'J ^ u lie I - e G 1 \' ^ 1 ^ 1 A g r. C v 5^ \' _ Tip M g *: h 263 

GA'j \j: - G AAA AA3 AAG CGC TTG TGG GTG GAG AG A GAA GGC AAc u.'^ ^ ■ 

r - ^- ^ ^ Lys lys Pre le- Cys Le^j Gin Arc Glu Ala Lys Val 2TE 

Cw* uAC TTG CCT GC3 GAT A*AG GGG CGG GGT AC A CAG vjGC CC^. GAu -.^j^ 

Pro His Leu Pro Ala Asp Lys Ala Ar? Gly Thr Gin Gly Pro Glu 2 93 

r"r r-"**- r^^^ > > ^r*^ %^r* ft><^-« T.^.^ i.^'^ ^w^*- — 

w/^O ^J^O '^As^ ^-O L ^ O ^11^ rtwn OL^O /^^JJw -w^ rtu^ jKo^ rto-*- - *0 ' 

Gin Gin His Leu Leu He Thr Ala Pro Ser Ser Ser Ser Ser Ser 3C8 

C-o GAG AGC TGG GCC AGT GGG TTG GAG AGA AGG GGG GG2 ACT CGG 1^^^ 

Leu Glu Ser Ser Ala Ser Ala Leu Asp Arg Arc Ala Pro Thr Arc 323 

AAC CAG CCA CAG GCA CCA GGG GTG GAG GCC AGT GGG GCC GGG GAG _lc / 

Asn Gin Pro Gin Ala Pro Gly Val Glu Ala Ser Gly Ala Gly Glu 335 

GCC CGG GCC AGC ACC GGG AGC TCA GAT TCT TCC CCT GGT GGC CAT 1212 

Ala Arc Ala Ser Thr Gly Ser Ser Asp Ser Ser Pro Gly Gly His 353 

GGG ACC CAG GTC AA.T GTC ACC TGC ATC GTG AAC GTC TGT AGC AGC 1257 

Gly Thr Gin Val Asn Val Thr Cys He Val Asn Val Cys Ser Ser 3 55 

TCT GAC CAC AGC TCA CAG TGC TCC TCC CAA GCC AGC TCC ACA ATG 13C2 

Ser Asp His Ser Ser Gin Cys Ser Ser Gin Ala Ser Ser Thr Met 353 

GGA GAC ACA GAT TCC AGC CCC TCG GAG TCC CCG AAG GAC GAG CAG 134 7 

Gly Asp Thr Asp Ser Ser Pro Ser Glu Ser Pro Lys Asp Glu Gin 395 

GTC CCC TTC TCC AAG GAG GAA TGT GCC TTT CGG TCA CAG CTG GAG 13 92 

Val Pro Phe Ser Lys Glu Glu Cys Ala Phe Arc Ser Gin Leu Glu 413 

ACG CCA GAG ACC CTG CTG GGG AGC ACC GAA GAG AAG CCC CTG CCC 14 37 

Thr Pro Glu Thr Leu Leu Gly Ser Thr Glu Glu Lys Pro Leu Pro 4 25 

CTT GGA GTG CCT GAT GCT GGG ATG AAG CCC AGT -4/0 

Leu Gly Val Pro Ksp Ala Gly Met Lys Pre Ser 439 

TA<A*CCAGG C CGGTGTGGGCT GTGTCGTAGCC AAGGTGvjGCTG AG CC CTG GC AGG ATio Aw. 

CCTGCGAAGGGGCCCTGCTCCTTCCAGGCCCCCACCACTAGGACTCTGAltGCTC - - - CT 
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